|
Name |
Accession |
Description |
Interval |
E-value |
| RHS_core |
NF041261 |
RHS element core protein; |
1915-2719 |
2.25e-34 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 145.53 E-value: 2.25e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1915 SAVVDALGGTTAVRH----DNRGRQIAVVDAEGRASEMQYNTLDQ----LTRITLAAGTADAGQRTQTWDALGNKLSETD 1986
Cdd:NF041261 210 TGMVDRFGRTLTFHReaagDLAGEITGVTDGAGREFRLVLTTQAQraeeARKQRTSSLSSPDGPRPLSSSAFPDTLPGGT 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1987 EEGRTTSYQYDAM----GRVLRKSLPSGAIVT-TYDLLGN-----KTSETNLRG-------------------DKTTYAY 2037
Cdd:NF041261 290 EYGPDNGIRLSAVwlthDPAYPESLPAAPLVRyTYTEAGEllavyDRSNTQVRAftydaqhpgrmvahryagrPEMCYRY 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2038 DDANRLVLRTEPATPPKTTGYAYDGVgnitTETDALG-RQTTHTYNH--LNQRTATKFADGTTSTAVHDGNGNKTSETDA 2114
Cdd:NF041261 370 DDTGRVTEQLNPAGLSYRYQYEQDRI----TITDSLNrREVLHTEGEggLKRVVKKEHADGSVTRSGYDAAGRLTAQTDA 445
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2115 LGRVTTYVHDALNRLLSQ--TIAGRSKRsMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTDYDKVGNK 2192
Cdd:NF041261 446 AGRRTEYSLNVVSGDITDitTPDGRETK-FYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSE 524
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2193 L--QVTNPLRQTQKWQYNARNWIVAQQDGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLLGTTAY 2270
Cdd:NF041261 525 LpaTTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEY 604
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2271 DADGHITSQSDARGNATSFTWDAIGRQLSRSQptaaGNAVTSTVYDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLG 2350
Cdd:NF041261 605 NAAGDLTAVITPDGNRSETQYDAWGKAVSTTQ----GGLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDG 680
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2351 IASATTYDAVGNpLTQTDGRGRVLTHQYND---FNLRTATSDGLGQvgtVEYDLHGNKTGETD-ANGH--ATSYQYDALH 2424
Cdd:NF041261 681 RTQRYHYDLTGK-LTQSEDEGLVTLWHYDEsdrITHRTVNGEPAEQ---WQYDEHGWLTDISHlSEGHrvAVHYGYDDKG 756
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2425 RVIAttragiQLQKTEYDEAGRIQFEtdargNKVGYEYDKRGLLVK-TNRSLGAIDLLqrdSMGDVTLATDSEGRTTTTG 2503
Cdd:NF041261 757 RLTG------ERQTVENPETGELLWQ-----HETGHAYNEQGLANRvTPDSLPPVEWL---TYGSGYLAGMKLGGTPLVE 822
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2504 YDKRRraisvadgLGNTTHSTFDLAGnltetkAPNGATVSYAYDPANRLAtiTQSLDSGQAQATITYDTSGNLLeqRDLN 2583
Cdd:NF041261 823 YTRDR--------LHRETVRSFGGAG------SNAAYELTTAYTPAGQLQ--SQHLNSLVYDRDYTWNDNGDLV--RISG 884
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2584 GQSTR-HAYDARNRRirttlpatqagEAVQTNGYDNADGLTEHTDANGNRfvhtLDIRGRRTQTVTTASQGNGPGSVLQT 2662
Cdd:NF041261 885 PRQTReYGYSATGRL-----------TGVHTTAANLDIRIPYATDPAGNR----LPDPELHPDSTLTAWPDNRIAEDAHY 949
|
810 820 830 840 850 860
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2585425666 2663 TFGYDANGNLTSTAQTDSQGT------RTETTTYDAFNRPVKVTD-AWGNSLTHS---YDPQGNRIG 2719
Cdd:NF041261 950 VYRYDEYGRLTEKTDRIPEGVirtddeRTHHYHYDSQHRLVFYTRiQHGEPLVESrylYDPLGRRMA 1016
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
2100-2886 |
3.03e-33 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 142.06 E-value: 3.03e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2100 AVHDGNGNKTsetdalgRVTTYVHDALNRLLSQTIAGRSKRSMVYDASGNLLSRTDANGNTSAFAYDAlNRVVAeTDALG 2179
Cdd:NF041261 332 AVYDRSNTQV-------RAFTYDAQHPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQ-DRITI-TDSLN 402
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2180 R--VTHTdydkvgnklqvtnplrqtqkwqynarnwivaqqDGEGhqtryghdkvgnrvtetwpngnivnfeydALNRLIR 2257
Cdd:NF041261 403 RreVLHT---------------------------------EGEG-----------------------------GLKRVVK 420
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2258 SEDSIGLLGTTAYDADGHITSQSDARGNATSFTWDAigrqlsrsqptaagnavtstvydAAGNIVSVTTPGGNVITTKYD 2337
Cdd:NF041261 421 KEHADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNV-----------------------VSGDITDITTPDGRETKFYYN 477
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2338 SRNRPIEILDSLGIASATTYDAVGNPLTQTDGRGRVLTHQYND--FNLRTATSDGLGQVGTVEYDLHGNKTGETDANGHA 2415
Cdd:NF041261 478 DGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDphSELPATTTDATGSTKQMTWSRYGQLLAFTDCSGYQ 557
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2416 TSYQYDALHRVIATTR-AGIQLQKTeYDEAGRIQFETDARGNKVGYEYdkrgllvktnrslgaidllqrDSMGDVTLATD 2494
Cdd:NF041261 558 TRYEYDRFGQMTAVHReEGISTYRR-YDNRGQLTSVKDAQGRETRYEY---------------------NAAGDLTAVIT 615
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2495 SEGRTTTTGYDKRRRAISVADGlGNTTHSTFDLAGNLTETKAPNGATVSYAYDPANRLatITQSLDSGQAQaTITYDTSG 2574
Cdd:NF041261 616 PDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRL--VQQRGFDGRTQ-RYHYDLTG 691
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2575 NLLEQRDlNGQSTRHAYDARNRRIRTTLpatqAGEAVQTNGYDNADGLTE--HTDANGNRFVH-TLDIRGRRT---QTVT 2648
Cdd:NF041261 692 KLTQSED-EGLVTLWHYDESDRITHRTV----NGEPAEQWQYDEHGWLTDisHLSEGHRVAVHyGYDDKGRLTgerQTVE 766
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2649 TASQgngpGSVL---QTTFGYDANGnlTSTAQTDSQGTRTETTTYDA--------FNRP-VKVT-DAWGNSLTHSYDPQG 2715
Cdd:NF041261 767 NPET----GELLwqhETGHAYNEQG--LANRVTPDSLPPVEWLTYGSgylagmklGGTPlVEYTrDRLHRETVRSFGGAG 840
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2716 NRIGTTAATASAPGGSVTTIEYDAL--NRRTSQSGAGGTTRISYDKSGRviqllhpdgsstSTRYDKAGRVAGETSStqa 2793
Cdd:NF041261 841 SNAAYELTTAYTPAGQLQSQHLNSLvyDRDYTWNDNGDLVRISGPRQTR------------EYGYSATGRLTGVHTT--- 905
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2794 tagAGNTLLDVAYTYDVNGNRIGSSRTESLSAA-----NRSAAlSAH-----------------LPGGSNSASHNRTrvE 2851
Cdd:NF041261 906 ---AANLDIRIPYATDPAGNRLPDPELHPDSTLtawpdNRIAE-DAHyvyrydeygrltektdrIPEGVIRTDDERT--H 979
|
810 820 830 840
....*....|....*....|....*....|....*....|
gi 2585425666 2852 SWTYDAQDRL-----TSHTTPERRTTWQLDAGGRRIQQNV 2886
Cdd:NF041261 980 HYHYDSQHRLvfytrIQHGEPLVESRYLYDPLGRRMAKRV 1019
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1901-2237 |
2.25e-27 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 122.80 E-value: 2.25e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1901 GNVTHMRYDARGMRSAVVDALGGTTAVR-HDNRGRQIAVVDAEGRASEMQYNTLDQLTRITlaagTADAGQRTQTWDALG 1979
Cdd:NF041261 426 GSVTRSGYDAAGRLTAQTDAAGRRTEYSlNVVSGDITDITTPDGRETKFYYNDGNQLTSVT----SPDGLESRREYDEPG 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1980 NKLSETDEEGRTTSYQYDAMGRVLRKSLPSGAIVT---TYDLLGNKTSETNLRGDKTTYAYDDANRL--VLRTEPATppk 2054
Cdd:NF041261 502 RLVSETSRSGETTRYRYDDPHSELPATTTDATGSTkqmTWSRYGQLLAFTDCSGYQTRYEYDRFGQMtaVHREEGIS--- 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2055 tTGYAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDalgrvttyvhdalnrllsqti 2134
Cdd:NF041261 579 -TYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQ--------------------- 636
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2135 aGRSKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGR--------------------VTHTDYDKVGNKLQ 2194
Cdd:NF041261 637 -GGLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRtqryhydltgkltqsedeglVTLWHYDESDRITH 715
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 2585425666 2195 VTNPLRQTQKWQYNARNWIVA-QQDGEGHQ--TRYGHDKVGNRVTE 2237
Cdd:NF041261 716 RTVNGEPAEQWQYDEHGWLTDiSHLSEGHRvaVHYGYDDKGRLTGE 761
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1856-2616 |
2.76e-24 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 112.40 E-value: 2.76e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1856 DRRGLTSTASFDARGNLTGE--QMPDGSRISHSVAANGDRQSTTDVRGNVTHMRYDARGMR----SAVVDALGGTTAVRH 1929
Cdd:NF041261 214 DRFGRTLTFHREAAGDLAGEitGVTDGAGREFRLVLTTQAQRAEEARKQRTSSLSSPDGPRplssSAFPDTLPGGTEYGP 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1930 DNRGRQIAVVDAEGRASEMQYNTLdQLTRITLAAG--------TADAGQRTQTWDA-LGNKLSETDEEGRTTS-YQYDAM 1999
Cdd:NF041261 294 DNGIRLSAVWLTHDPAYPESLPAA-PLVRYTYTEAgellavydRSNTQVRAFTYDAqHPGRMVAHRYAGRPEMcYRYDDT 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2000 GRVLRKSLPSG---------AIVTTYDLLGNK-----TSETNLR---------GDKTTYAYDDANRLVLRTEPATppKTT 2056
Cdd:NF041261 373 GRVTEQLNPAGlsyryqyeqDRITITDSLNRRevlhtEGEGGLKrvvkkehadGSVTRSGYDAAGRLTAQTDAAG--RRT 450
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2057 GYAYDGV-GNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNRLLSQTI- 2134
Cdd:NF041261 451 EYSLNVVsGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTt 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2135 -AGRSKRSMVYDASGNLLSRTDANGNTSAF---------------------AYDALNRVVAETDALGRVTHTDYDKVGNK 2192
Cdd:NF041261 531 dATGSTKQMTWSRYGQLLAFTDCSGYQTRYeydrfgqmtavhreegistyrRYDNRGQLTSVKDAQGRETRYEYNAAGDL 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2193 LQVTNPLRQTQKWQYNARNWIVAQQDGeGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNR------------------ 2254
Cdd:NF041261 611 TAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRlvqqrgfdgrtqryhydl 689
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2255 ---LIRSEDSiGLLGTTAYDADGHITSQSDARGNATSFTWDAIGrQLSRSQPTAAGNAVTstvydaagnivsvttpggnv 2331
Cdd:NF041261 690 tgkLTQSEDE-GLVTLWHYDESDRITHRTVNGEPAEQWQYDEHG-WLTDISHLSEGHRVA-------------------- 747
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2332 ITTKYDSRNRpieildslgiaSATTYDAVGNPLTqtdgrGRVL-----THQYNDFNLRT-ATSDGLGQVGTVEYD---LH 2402
Cdd:NF041261 748 VHYGYDDKGR-----------LTGERQTVENPET-----GELLwqhetGHAYNEQGLANrVTPDSLPPVEWLTYGsgyLA 811
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2403 GNKTGETDanghATSYQYDALHRVIATTRAGIQL-----QKTEYDEAGRIQFE-TDARGNKVGYEYDKRGLLVKTNrslg 2476
Cdd:NF041261 812 GMKLGGTP----LVEYTRDRLHRETVRSFGGAGSnaayeLTTAYTPAGQLQSQhLNSLVYDRDYTWNDNGDLVRIS---- 883
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2477 aidllqrdsmgdvtlatdSEGRTTTTGYDKRRRAISVadglgNTTHSTFDLagnltetkapngaTVSYAYDPA-NRL--- 2552
Cdd:NF041261 884 ------------------GPRQTREYGYSATGRLTGV-----HTTAANLDI-------------RIPYATDPAgNRLpdp 927
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2553 -----ATITQSLD---SGQAQATITYDTSGNLLEQRDL---------NGQSTRHAYDARNRRIRTTlpATQAGEAVQTNG 2615
Cdd:NF041261 928 elhpdSTLTAWPDnriAEDAHYVYRYDEYGRLTEKTDRipegvirtdDERTHHYHYDSQHRLVFYT--RIQHGEPLVESR 1005
|
.
gi 2585425666 2616 Y 2616
Cdd:NF041261 1006 Y 1006
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1739-2757 |
3.65e-23 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 108.69 E-value: 3.65e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1739 DGGHTAFAYNAKPEEPATDVTDARGKLTRYAFNKYGNPLSIAGPAGTTSMTWAVNDVLMLSKTDANGVVTSYTYDANGNQ 1818
Cdd:COG3209 1 ETSLGLVGGTTGASSTLLAATNAGGGTAVTNAGSTVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGV 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1819 TSEQVSGSGGTQAVSTSQTWLAQTAPPFIKNKRLSFTDRRGLTSTASFDARGNLTGEQMPDGSRISHSVAANGDRQSTTD 1898
Cdd:COG3209 81 TALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1899 VRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTRITLAAGTADAGQRTQTWDAL 1978
Cdd:COG3209 161 LAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1979 GNKL-SETDEEGRTTSYQYDAMGRVLRKSLPSGAIVTTYDLLGNKTSETNLRGDKTTYAYDDANRLVLRTEPATPPKTTG 2057
Cdd:COG3209 241 SATGaAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGT 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2058 YAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNrllsqTIAGR 2137
Cdd:COG3209 321 TGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGT-----ATGSG 395
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2138 SKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTDYDKVGNKLQVTNPLRQTQKWQYNARNWIVAQQ 2217
Cdd:COG3209 396 GGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGG 475
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2218 DGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLLGTTAYDADGHITSQSDARGNATSFTWDAIGRQ 2297
Cdd:COG3209 476 GTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVG 555
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2298 LSRSQPTAAGNAVTSTVYDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLGIASATTYDAVGNPLTQTDGRGRVLTHQ 2377
Cdd:COG3209 556 TGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATA 635
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2378 YNDFNLRTATSDGLGQVGTVEYDLHGNKTGETDANGHATSYQYDALHRVIATTRAGIQLQKTEYDEAGRIQFETDARGNK 2457
Cdd:COG3209 636 STGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGT 715
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2458 VGYEYDKRGLLVKTNRSLGAIDLLQRDSMGDVTLATDSEGRTTTTGYDKRRRAISVADGLGN-----TTHSTFDLAGNLT 2532
Cdd:COG3209 716 TTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVtqgtyTTRYTYDALGRLT 795
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2533 ETKAPNGATVSYAYDPANRLATITQSLDSGQA---QATITYDTSGNLLEQRDlngqstrhaydarnrrirttlpATQAGE 2609
Cdd:COG3209 796 SVTYPDGETVTYTYDALGRLTSVITVGSGGGTdlqDRTYTYDAAGNITSITD----------------------ALRAGT 853
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2610 AVQTNGYDNADGLTEHTDANGnrfvhtldirgrrtqtvttasqgngpgsvlQTTFGYDANGNLTSTAQTDSQgtrteTTT 2689
Cdd:COG3209 854 LTQTYTYDALGRLTSATDPGT------------------------------TESYTYDANGNLTSRTDGGTT-----TYT 898
|
970 980 990 1000 1010 1020 1030
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2690 YDAFNRPVKVTDAWGNSLTHSYDPQG--NRIGTTAATASAPGGSVTTIEYDALNRRTSQSGAGGTTRISY 2757
Cdd:COG3209 899 YDALGRLVSVTKPDGTTTTYTYDALGhtDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRF 968
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1657-2431 |
1.57e-21 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 103.54 E-value: 1.57e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1657 AFEYDA--AGNLVKITRTDAPEAteSYSYSDhtgplgmSNLLLSHTNALGQATQFKYHSG-----------PVLRQFGNG 1723
Cdd:NF041261 343 AFTYDAqhPGRMVAHRYAGRPEM--CYRYDD-------TGRVTEQLNPAGLSYRYQYEQDrititdslnrrEVLHTEGEG 413
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1724 QIpsfeSTVIGVTAADGGHTAFAYNAKPEEPATdvTDARGKLTRYAFNKY-GNPLSIAGPAGTTSMTWAVNDVLMLSKTD 1802
Cdd:NF041261 414 GL----KRVVKKEHADGSVTRSGYDAAGRLTAQ--TDAAGRRTEYSLNVVsGDITDITTPDGRETKFYYNDGNQLTSVTS 487
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1803 ANGVVTSYTYDANGNQTSEqVSGSGGTqavstsqtwlaqtappfiknKRLSFTDRRGLTSTASFDArgnltgeqmpDGSR 1882
Cdd:NF041261 488 PDGLESRREYDEPGRLVSE-TSRSGET--------------------TRYRYDDPHSELPATTTDA----------TGST 536
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1883 ISHSVAANGDRQSTTDVRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTritlA 1962
Cdd:NF041261 537 KQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLT----A 612
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1963 AGTADAGQRTQTWDALGNKLSETdEEGRTTSYQYDAMGRVlrkslpsgaivttydllgnkTSETNLRGDKTTYAYDDANR 2042
Cdd:NF041261 613 VITPDGNRSETQYDAWGKAVSTT-QGGLTRSMEYDAAGRI--------------------TTLTNENGSHSTFLYDALDR 671
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2043 LVlrtepatppKTTGYaydgvgnittetdalgrqtthtynhlnqrtatkfaDGTTSTAVHDGNGNKTSETDAlGRVTTYV 2122
Cdd:NF041261 672 LV---------QQRGF-----------------------------------DGRTQRYHYDLTGKLTQSEDE-GLVTLWH 706
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2123 HDALNRLLSQTIAGRSKRSMVYDASGNL--LSRTdANGNTSAFAYDalnrvvaeTDALGRVThtdydkvGNKLQVTNPLR 2200
Cdd:NF041261 707 YDESDRITHRTVNGEPAEQWQYDEHGWLtdISHL-SEGHRVAVHYG--------YDDKGRLT-------GERQTVENPET 770
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2201 QTQKWQynarnwivaqqdgegHQTRYGHDKVG--NRVTE---------TWPNG----------NIVNFEYDALNRliRSE 2259
Cdd:NF041261 771 GELLWQ---------------HETGHAYNEQGlaNRVTPdslppvewlTYGSGylagmklggtPLVEYTRDRLHR--ETV 833
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2260 DSIGLLG-------TTAYDADGHITSQS-DARGNATSFTWDAIGRQLSRSQPtaagNAVTSTVYDAAGNIVSVTTPGGNv 2331
Cdd:NF041261 834 RSFGGAGsnaayelTTAYTPAGQLQSQHlNSLVYDRDYTWNDNGDLVRISGP----RQTREYGYSATGRLTGVHTTAAN- 908
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2332 ittkydsrnrpieildsLGIASATTYDAVGNPLTQTDgrgrvlthQYNDFNLRTATSDGLGQVG--TVEYDLHGNKTGET 2409
Cdd:NF041261 909 -----------------LDIRIPYATDPAGNRLPDPE--------LHPDSTLTAWPDNRIAEDAhyVYRYDEYGRLTEKT 963
|
810 820 830
....*....|....*....|....*....|.
gi 2585425666 2410 DA---------NGHATSYQYDALHRVIATTR 2431
Cdd:NF041261 964 DRipegvirtdDERTHHYHYDSQHRLVFYTR 994
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1427-2380 |
4.76e-18 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 92.13 E-value: 4.76e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1427 NGYAFTLTAVSDKDGSVEPNQGLVISKLMLNDALPVGNILIQGVNVKSGRINLSGMGMGVAARGPQLALRPSYSSGGSGS 1506
Cdd:COG3209 14 SSTLLAATNAGGGTAVTNAGSTVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGVTALGDASAAGGGY 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1507 VGVLGVNWGHNFDASLSTTACGDILVNAGDGGFIRFLPQGNGTLTPAkGYHGTLIANNGDRSYDFYSKDGTRYHFGFIGG 1586
Cdd:COG3209 94 VGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGAT-AGSATTGSTDGGRGGVAVTGLAGGGASAYGLT 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1587 KRQWALQSITDTNGNALTLTYDIGVDAPLLQVQNAYGQSLQFFYQTRAFVGSGAAVNVLQKVQGPEDMGLAFEYDAAGNL 1666
Cdd:COG3209 173 LGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAV 252
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1667 VKITRTDAPEATESYSYSDHTGPLGMSNLLLSHTNALGQATQFKYHSGPVLRQFGNGQIPSFESTVIGVTAADGGHTAFA 1746
Cdd:COG3209 253 ATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADA 332
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1747 YNAKPEEPATDVTDARGKLTRYAFNKYGNPLSIAGPAGTTSMTWAVNDVLMLSKTDANGVVTSYTYDANGNQTSEQVSGS 1826
Cdd:COG3209 333 GTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTST 412
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1827 GGTQAVSTSQTWLAQTAPPFIKNKRLSFTDRRGLTSTASFDARGNLTGEQMPDGSRISHSVAANGDRQSTTDVRGNVTHM 1906
Cdd:COG3209 413 TGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAG 492
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1907 RYDARGMRSAVVD-ALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTRITLAAGTADAGQRTQTWDALGNKLSET 1985
Cdd:COG3209 493 ATTLGTDTTLDDTlGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTG 572
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1986 DEEGRTTSYQYDAMGRVLRKSLPSGAIVTTYDLLGNKTSETNLRGDKTTYAYDDANRLVLRTEPATPPKTTGYAYDGVGN 2065
Cdd:COG3209 573 DGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTT 652
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2066 ITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNRLLSQTIAGRSKRSMVYD 2145
Cdd:COG3209 653 GTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTT 732
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2146 ASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVThtdydkvgnklQVTNPLRQTQkwqynarnwivaqqdgEGHQTR 2225
Cdd:COG3209 733 DGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT-----------SETTPGGVTQ----------------GTYTTR 785
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2226 YGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLLGTT------AYDADGHITSQSDARGNA---TSFTWDAIGR 2296
Cdd:COG3209 786 YTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDlqdrtyTYDAAGNITSITDALRAGtltQTYTYDALGR 865
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2297 QLSRSQPTaagnAVTSTVYDAAGNIVSVTTPGGNVITtkYDSRNRPIEILDSLGIASATTYDA------VGNPLTQTDGR 2370
Cdd:COG3209 866 LTSATDPG----TTESYTYDANGNLTSRTDGGTTTYT--YDALGRLVSVTKPDGTTTTYTYDAlghtdhLGSVRALTDAS 939
|
970
....*....|
gi 2585425666 2371 GRVLTHQYND 2380
Cdd:COG3209 940 GQVVWRYDYD 949
|
|
| YebA |
COG1305 |
Transglutaminase-like enzyme, putative cysteine protease [Posttranslational modification, ... |
184-315 |
6.35e-14 |
|
Transglutaminase-like enzyme, putative cysteine protease [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 440916 [Multi-domain] Cd Length: 174 Bit Score: 72.34 E-value: 6.35e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 184 ATGQAQGNAPVLRAAMLPVRPLGLAVRAPVSAPVILPSYeAGQEIAAMPQDVADAPEAPLNEEIVAKAKEL------DYD 257
Cdd:COG1305 1 LAGLVLAALLAALSGPLAPAPTGLLVTAGAGRGGGVASV-VPGGGTELLAGPGELLSASYDPELRALAAELtggattPYE 79
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2585425666 258 YVR-IYEYVRNGIRsewYS----GSTKGALGTLRTGAGNAVDQASLLVAMLRAAGAPARYVHG 315
Cdd:COG1305 80 KARaLYDWVRDNIR---YDpgstGVGTTALETLERRRGVCRDFAHLLVALLRALGIPARYVSG 139
|
|
| Transglut_core |
pfam01841 |
Transglutaminase-like superfamily; This family includes animal transglutaminases and other ... |
243-316 |
1.76e-12 |
|
Transglutaminase-like superfamily; This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterized transglutaminase, the human blood clotting factor XIIIa'. On the basis of the experimentally demonstrated activity of the Methanobacterium phage pseudomurein endoisopeptidase, it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease.
Pssm-ID: 376628 [Multi-domain] Cd Length: 108 Bit Score: 65.89 E-value: 1.76e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 243 LNEEIVAKAKElDYDYVR-IYEYVRNGIrseWYSGSTKG-----ALGTLRTGAGNAVDQASLLVAMLRAAGAPARYVHGV 316
Cdd:pfam01841 3 LADRITGGATD-PLEKARaIYDYVRKNI---TYDLPGRSpgdgdAEEFLFTGKGDCEDFASLFVALLRALGIPARYVTGY 78
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2144-2180 |
9.92e-08 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 50.29 E-value: 9.92e-08
10 20 30
....*....|....*....|....*....|....*..
gi 2585425666 2144 YDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGR 2180
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2144-2185 |
2.26e-07 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 49.13 E-value: 2.26e-07
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2585425666 2144 YDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTD 2185
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| Big_1 |
pfam02369 |
Bacterial Ig-like domain (group 1); This family consists of bacterial domains with an Ig-like ... |
878-933 |
5.73e-06 |
|
Bacterial Ig-like domain (group 1); This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial surface proteins such as intimins and invasins involved in pathogenicity.
Pssm-ID: 460541 [Multi-domain] Cd Length: 64 Bit Score: 46.01 E-value: 5.73e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 2585425666 878 SVRVRDRDGRPVKGASVSFMAvrGGGSVSPATGTTNALGVASTTVTlgqSTLASSV 933
Cdd:pfam02369 10 TATVTDANGNPVPGATVTFSA--SGGTLSASSGTTDANGQATVTLT---STKAGTV 60
|
|
| TGc |
smart00460 |
Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish ... |
284-315 |
1.55e-05 |
|
Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
Pssm-ID: 214673 Cd Length: 68 Bit Score: 45.07 E-value: 1.55e-05
10 20 30
....*....|....*....|....*....|..
gi 2585425666 284 TLRTGAGNAVDQASLLVAMLRAAGAPARYVHG 315
Cdd:smart00460 1 LLKTKYGTCGEFAALFVALLRSLGIPARVVSG 32
|
|
| Bacuni_01323_like |
cd12871 |
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded ... |
1658-1779 |
2.17e-03 |
|
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded beta barrels resembling outer membrane porins. The interior of the barrels is mostly occupied by an insert with partially helical structure.
Pssm-ID: 214015 [Multi-domain] Cd Length: 231 Bit Score: 42.41 E-value: 2.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1658 FEYDAAGNLVKITRTDAPEATE-SYSYSDhtgplgmsNLLLSHT---NALGQATQFKYHSGPVLRQFGN-GQIPSFESTV 1732
Cdd:cd12871 95 FTYNADGQLTKIVESIGTEYSTiTITWNN--------GDIVSIStksNTEENESKITYTSDKVYNPIVNkGCLMLFGLTL 166
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 2585425666 1733 IgvtaADGGHTAFAYNAK---------PEEPATDVTDARGKLTrYAFNKYGNPLSI 1779
Cdd:cd12871 167 G----YDLSDLFYAYYAGllgkatkhlPESIIPKGNEETTTYT-YTFDKNGYPTSI 217
|
|
| PksD |
COG3321 |
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ... |
134-635 |
2.56e-03 |
|
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];
Pssm-ID: 442550 [Multi-domain] Cd Length: 1386 Bit Score: 43.71 E-value: 2.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 134 SLPSSFEARRAAVQVQIDQLLQKLAAAMPGMDGDSQQKAQAVGALREALQATGQAQGNAPVLRAAMLPVRPLGLAVRAPV 213
Cdd:COG3321 862 PLPTYPFQREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAALLALVALAAAAAA 941
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 214 SAPVILPSYEAGQEIAAMPQDVADAPEAPLNEEIVAKAKELDYDYVRIYEYVRNGIRSEWYSGSTKGALGTLRTGAGNAV 293
Cdd:COG3321 942 LLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAAAAL 1021
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 294 DQASLLVAMLRAAGAPARYVHGVAEIGVDGIASAAGLGDPGLVPEMLAKAGIAYSPVVQGGRVALVRMEHTWVAVQVPYT 373
Cdd:COG3321 1022 LALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALALAA 1101
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 374 NYRGIVLDASGKTWLPLDVFHktLQPRPAGAGLADLGLDLQQLAMQYRSKVQSMDFGSFVREQVDAALQPKSSSYEAAAA 453
Cdd:COG3321 1102 LAAALLLLALLAALALAAAAA--ALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALAL 1179
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 454 PMPIRAQALGLLPNTLAFTVVAATAESAALPDAVRSTARLRLFNDATGAGEAGLDISLPVHELFNQRATINYIPAELADH 533
Cdd:COG3321 1180 ALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAAALLAAL 1259
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 534 RAILLAGGL--DLAPLYLYQLRPELRLDGYQRKVGLAPLAGGSQVKFRLDIQNPANTQTVEQSFLVGAYHAIGVGQSGVA 611
Cdd:COG3321 1260 AALALLAAAagLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAAVAA 1339
|
490 500
....*....|....*....|....
gi 2585425666 612 RSATPSARDGEYDAARLLDGIIQR 635
Cdd:COG3321 1340 ALALAAAAAAAAAAAAAAAAAAAL 1363
|
|
| YfaS |
COG2373 |
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ... |
876-941 |
6.29e-03 |
|
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];
Pssm-ID: 441940 [Multi-domain] Cd Length: 1605 Bit Score: 42.38 E-value: 6.29e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2585425666 876 SMSVRVRDR-DGRPVKGASVSFMAVRGggsVSPATGTTNALGVASTTVTLGQSTLASSVYVLVKPGD 941
Cdd:COG2373 277 GLLVFVTSLsTGKPVAGAEVELYDRNG---QVLATATTDADGLARFPAGDRGEGGRAPALLVARKGG 340
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| RHS_core |
NF041261 |
RHS element core protein; |
1915-2719 |
2.25e-34 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 145.53 E-value: 2.25e-34
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1915 SAVVDALGGTTAVRH----DNRGRQIAVVDAEGRASEMQYNTLDQ----LTRITLAAGTADAGQRTQTWDALGNKLSETD 1986
Cdd:NF041261 210 TGMVDRFGRTLTFHReaagDLAGEITGVTDGAGREFRLVLTTQAQraeeARKQRTSSLSSPDGPRPLSSSAFPDTLPGGT 289
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1987 EEGRTTSYQYDAM----GRVLRKSLPSGAIVT-TYDLLGN-----KTSETNLRG-------------------DKTTYAY 2037
Cdd:NF041261 290 EYGPDNGIRLSAVwlthDPAYPESLPAAPLVRyTYTEAGEllavyDRSNTQVRAftydaqhpgrmvahryagrPEMCYRY 369
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2038 DDANRLVLRTEPATPPKTTGYAYDGVgnitTETDALG-RQTTHTYNH--LNQRTATKFADGTTSTAVHDGNGNKTSETDA 2114
Cdd:NF041261 370 DDTGRVTEQLNPAGLSYRYQYEQDRI----TITDSLNrREVLHTEGEggLKRVVKKEHADGSVTRSGYDAAGRLTAQTDA 445
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2115 LGRVTTYVHDALNRLLSQ--TIAGRSKRsMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTDYDKVGNK 2192
Cdd:NF041261 446 AGRRTEYSLNVVSGDITDitTPDGRETK-FYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSE 524
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2193 L--QVTNPLRQTQKWQYNARNWIVAQQDGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLLGTTAY 2270
Cdd:NF041261 525 LpaTTTDATGSTKQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEY 604
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2271 DADGHITSQSDARGNATSFTWDAIGRQLSRSQptaaGNAVTSTVYDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLG 2350
Cdd:NF041261 605 NAAGDLTAVITPDGNRSETQYDAWGKAVSTTQ----GGLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDG 680
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2351 IASATTYDAVGNpLTQTDGRGRVLTHQYND---FNLRTATSDGLGQvgtVEYDLHGNKTGETD-ANGH--ATSYQYDALH 2424
Cdd:NF041261 681 RTQRYHYDLTGK-LTQSEDEGLVTLWHYDEsdrITHRTVNGEPAEQ---WQYDEHGWLTDISHlSEGHrvAVHYGYDDKG 756
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2425 RVIAttragiQLQKTEYDEAGRIQFEtdargNKVGYEYDKRGLLVK-TNRSLGAIDLLqrdSMGDVTLATDSEGRTTTTG 2503
Cdd:NF041261 757 RLTG------ERQTVENPETGELLWQ-----HETGHAYNEQGLANRvTPDSLPPVEWL---TYGSGYLAGMKLGGTPLVE 822
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2504 YDKRRraisvadgLGNTTHSTFDLAGnltetkAPNGATVSYAYDPANRLAtiTQSLDSGQAQATITYDTSGNLLeqRDLN 2583
Cdd:NF041261 823 YTRDR--------LHRETVRSFGGAG------SNAAYELTTAYTPAGQLQ--SQHLNSLVYDRDYTWNDNGDLV--RISG 884
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2584 GQSTR-HAYDARNRRirttlpatqagEAVQTNGYDNADGLTEHTDANGNRfvhtLDIRGRRTQTVTTASQGNGPGSVLQT 2662
Cdd:NF041261 885 PRQTReYGYSATGRL-----------TGVHTTAANLDIRIPYATDPAGNR----LPDPELHPDSTLTAWPDNRIAEDAHY 949
|
810 820 830 840 850 860
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2585425666 2663 TFGYDANGNLTSTAQTDSQGT------RTETTTYDAFNRPVKVTD-AWGNSLTHS---YDPQGNRIG 2719
Cdd:NF041261 950 VYRYDEYGRLTEKTDRIPEGVirtddeRTHHYHYDSQHRLVFYTRiQHGEPLVESrylYDPLGRRMA 1016
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
2100-2886 |
3.03e-33 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 142.06 E-value: 3.03e-33
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2100 AVHDGNGNKTsetdalgRVTTYVHDALNRLLSQTIAGRSKRSMVYDASGNLLSRTDANGNTSAFAYDAlNRVVAeTDALG 2179
Cdd:NF041261 332 AVYDRSNTQV-------RAFTYDAQHPGRMVAHRYAGRPEMCYRYDDTGRVTEQLNPAGLSYRYQYEQ-DRITI-TDSLN 402
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2180 R--VTHTdydkvgnklqvtnplrqtqkwqynarnwivaqqDGEGhqtryghdkvgnrvtetwpngnivnfeydALNRLIR 2257
Cdd:NF041261 403 RreVLHT---------------------------------EGEG-----------------------------GLKRVVK 420
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2258 SEDSIGLLGTTAYDADGHITSQSDARGNATSFTWDAigrqlsrsqptaagnavtstvydAAGNIVSVTTPGGNVITTKYD 2337
Cdd:NF041261 421 KEHADGSVTRSGYDAAGRLTAQTDAAGRRTEYSLNV-----------------------VSGDITDITTPDGRETKFYYN 477
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2338 SRNRPIEILDSLGIASATTYDAVGNPLTQTDGRGRVLTHQYND--FNLRTATSDGLGQVGTVEYDLHGNKTGETDANGHA 2415
Cdd:NF041261 478 DGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDphSELPATTTDATGSTKQMTWSRYGQLLAFTDCSGYQ 557
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2416 TSYQYDALHRVIATTR-AGIQLQKTeYDEAGRIQFETDARGNKVGYEYdkrgllvktnrslgaidllqrDSMGDVTLATD 2494
Cdd:NF041261 558 TRYEYDRFGQMTAVHReEGISTYRR-YDNRGQLTSVKDAQGRETRYEY---------------------NAAGDLTAVIT 615
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2495 SEGRTTTTGYDKRRRAISVADGlGNTTHSTFDLAGNLTETKAPNGATVSYAYDPANRLatITQSLDSGQAQaTITYDTSG 2574
Cdd:NF041261 616 PDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRL--VQQRGFDGRTQ-RYHYDLTG 691
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2575 NLLEQRDlNGQSTRHAYDARNRRIRTTLpatqAGEAVQTNGYDNADGLTE--HTDANGNRFVH-TLDIRGRRT---QTVT 2648
Cdd:NF041261 692 KLTQSED-EGLVTLWHYDESDRITHRTV----NGEPAEQWQYDEHGWLTDisHLSEGHRVAVHyGYDDKGRLTgerQTVE 766
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2649 TASQgngpGSVL---QTTFGYDANGnlTSTAQTDSQGTRTETTTYDA--------FNRP-VKVT-DAWGNSLTHSYDPQG 2715
Cdd:NF041261 767 NPET----GELLwqhETGHAYNEQG--LANRVTPDSLPPVEWLTYGSgylagmklGGTPlVEYTrDRLHRETVRSFGGAG 840
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2716 NRIGTTAATASAPGGSVTTIEYDAL--NRRTSQSGAGGTTRISYDKSGRviqllhpdgsstSTRYDKAGRVAGETSStqa 2793
Cdd:NF041261 841 SNAAYELTTAYTPAGQLQSQHLNSLvyDRDYTWNDNGDLVRISGPRQTR------------EYGYSATGRLTGVHTT--- 905
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2794 tagAGNTLLDVAYTYDVNGNRIGSSRTESLSAA-----NRSAAlSAH-----------------LPGGSNSASHNRTrvE 2851
Cdd:NF041261 906 ---AANLDIRIPYATDPAGNRLPDPELHPDSTLtawpdNRIAE-DAHyvyrydeygrltektdrIPEGVIRTDDERT--H 979
|
810 820 830 840
....*....|....*....|....*....|....*....|
gi 2585425666 2852 SWTYDAQDRL-----TSHTTPERRTTWQLDAGGRRIQQNV 2886
Cdd:NF041261 980 HYHYDSQHRLvfytrIQHGEPLVESRYLYDPLGRRMAKRV 1019
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1901-2237 |
2.25e-27 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 122.80 E-value: 2.25e-27
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1901 GNVTHMRYDARGMRSAVVDALGGTTAVR-HDNRGRQIAVVDAEGRASEMQYNTLDQLTRITlaagTADAGQRTQTWDALG 1979
Cdd:NF041261 426 GSVTRSGYDAAGRLTAQTDAAGRRTEYSlNVVSGDITDITTPDGRETKFYYNDGNQLTSVT----SPDGLESRREYDEPG 501
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1980 NKLSETDEEGRTTSYQYDAMGRVLRKSLPSGAIVT---TYDLLGNKTSETNLRGDKTTYAYDDANRL--VLRTEPATppk 2054
Cdd:NF041261 502 RLVSETSRSGETTRYRYDDPHSELPATTTDATGSTkqmTWSRYGQLLAFTDCSGYQTRYEYDRFGQMtaVHREEGIS--- 578
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2055 tTGYAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDalgrvttyvhdalnrllsqti 2134
Cdd:NF041261 579 -TYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLTAVITPDGNRSETQYDAWGKAVSTTQ--------------------- 636
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2135 aGRSKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGR--------------------VTHTDYDKVGNKLQ 2194
Cdd:NF041261 637 -GGLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRLVQQRGFDGRtqryhydltgkltqsedeglVTLWHYDESDRITH 715
|
330 340 350 360
....*....|....*....|....*....|....*....|....*.
gi 2585425666 2195 VTNPLRQTQKWQYNARNWIVA-QQDGEGHQ--TRYGHDKVGNRVTE 2237
Cdd:NF041261 716 RTVNGEPAEQWQYDEHGWLTDiSHLSEGHRvaVHYGYDDKGRLTGE 761
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1856-2616 |
2.76e-24 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 112.40 E-value: 2.76e-24
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1856 DRRGLTSTASFDARGNLTGE--QMPDGSRISHSVAANGDRQSTTDVRGNVTHMRYDARGMR----SAVVDALGGTTAVRH 1929
Cdd:NF041261 214 DRFGRTLTFHREAAGDLAGEitGVTDGAGREFRLVLTTQAQRAEEARKQRTSSLSSPDGPRplssSAFPDTLPGGTEYGP 293
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1930 DNRGRQIAVVDAEGRASEMQYNTLdQLTRITLAAG--------TADAGQRTQTWDA-LGNKLSETDEEGRTTS-YQYDAM 1999
Cdd:NF041261 294 DNGIRLSAVWLTHDPAYPESLPAA-PLVRYTYTEAgellavydRSNTQVRAFTYDAqHPGRMVAHRYAGRPEMcYRYDDT 372
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2000 GRVLRKSLPSG---------AIVTTYDLLGNK-----TSETNLR---------GDKTTYAYDDANRLVLRTEPATppKTT 2056
Cdd:NF041261 373 GRVTEQLNPAGlsyryqyeqDRITITDSLNRRevlhtEGEGGLKrvvkkehadGSVTRSGYDAAGRLTAQTDAAG--RRT 450
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2057 GYAYDGV-GNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNRLLSQTI- 2134
Cdd:NF041261 451 EYSLNVVsGDITDITTPDGRETKFYYNDGNQLTSVTSPDGLESRREYDEPGRLVSETSRSGETTRYRYDDPHSELPATTt 530
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2135 -AGRSKRSMVYDASGNLLSRTDANGNTSAF---------------------AYDALNRVVAETDALGRVTHTDYDKVGNK 2192
Cdd:NF041261 531 dATGSTKQMTWSRYGQLLAFTDCSGYQTRYeydrfgqmtavhreegistyrRYDNRGQLTSVKDAQGRETRYEYNAAGDL 610
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2193 LQVTNPLRQTQKWQYNARNWIVAQQDGeGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNR------------------ 2254
Cdd:NF041261 611 TAVITPDGNRSETQYDAWGKAVSTTQG-GLTRSMEYDAAGRITTLTNENGSHSTFLYDALDRlvqqrgfdgrtqryhydl 689
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2255 ---LIRSEDSiGLLGTTAYDADGHITSQSDARGNATSFTWDAIGrQLSRSQPTAAGNAVTstvydaagnivsvttpggnv 2331
Cdd:NF041261 690 tgkLTQSEDE-GLVTLWHYDESDRITHRTVNGEPAEQWQYDEHG-WLTDISHLSEGHRVA-------------------- 747
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2332 ITTKYDSRNRpieildslgiaSATTYDAVGNPLTqtdgrGRVL-----THQYNDFNLRT-ATSDGLGQVGTVEYD---LH 2402
Cdd:NF041261 748 VHYGYDDKGR-----------LTGERQTVENPET-----GELLwqhetGHAYNEQGLANrVTPDSLPPVEWLTYGsgyLA 811
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2403 GNKTGETDanghATSYQYDALHRVIATTRAGIQL-----QKTEYDEAGRIQFE-TDARGNKVGYEYDKRGLLVKTNrslg 2476
Cdd:NF041261 812 GMKLGGTP----LVEYTRDRLHRETVRSFGGAGSnaayeLTTAYTPAGQLQSQhLNSLVYDRDYTWNDNGDLVRIS---- 883
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2477 aidllqrdsmgdvtlatdSEGRTTTTGYDKRRRAISVadglgNTTHSTFDLagnltetkapngaTVSYAYDPA-NRL--- 2552
Cdd:NF041261 884 ------------------GPRQTREYGYSATGRLTGV-----HTTAANLDI-------------RIPYATDPAgNRLpdp 927
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2553 -----ATITQSLD---SGQAQATITYDTSGNLLEQRDL---------NGQSTRHAYDARNRRIRTTlpATQAGEAVQTNG 2615
Cdd:NF041261 928 elhpdSTLTAWPDnriAEDAHYVYRYDEYGRLTEKTDRipegvirtdDERTHHYHYDSQHRLVFYT--RIQHGEPLVESR 1005
|
.
gi 2585425666 2616 Y 2616
Cdd:NF041261 1006 Y 1006
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1739-2757 |
3.65e-23 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 108.69 E-value: 3.65e-23
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1739 DGGHTAFAYNAKPEEPATDVTDARGKLTRYAFNKYGNPLSIAGPAGTTSMTWAVNDVLMLSKTDANGVVTSYTYDANGNQ 1818
Cdd:COG3209 1 ETSLGLVGGTTGASSTLLAATNAGGGTAVTNAGSTVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGV 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1819 TSEQVSGSGGTQAVSTSQTWLAQTAPPFIKNKRLSFTDRRGLTSTASFDARGNLTGEQMPDGSRISHSVAANGDRQSTTD 1898
Cdd:COG3209 81 TALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1899 VRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTRITLAAGTADAGQRTQTWDAL 1978
Cdd:COG3209 161 LAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1979 GNKL-SETDEEGRTTSYQYDAMGRVLRKSLPSGAIVTTYDLLGNKTSETNLRGDKTTYAYDDANRLVLRTEPATPPKTTG 2057
Cdd:COG3209 241 SATGaAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGT 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2058 YAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNrllsqTIAGR 2137
Cdd:COG3209 321 TGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGT-----ATGSG 395
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2138 SKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTDYDKVGNKLQVTNPLRQTQKWQYNARNWIVAQQ 2217
Cdd:COG3209 396 GGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGG 475
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2218 DGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLLGTTAYDADGHITSQSDARGNATSFTWDAIGRQ 2297
Cdd:COG3209 476 GTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVG 555
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2298 LSRSQPTAAGNAVTSTVYDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLGIASATTYDAVGNPLTQTDGRGRVLTHQ 2377
Cdd:COG3209 556 TGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATA 635
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2378 YNDFNLRTATSDGLGQVGTVEYDLHGNKTGETDANGHATSYQYDALHRVIATTRAGIQLQKTEYDEAGRIQFETDARGNK 2457
Cdd:COG3209 636 STGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGT 715
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2458 VGYEYDKRGLLVKTNRSLGAIDLLQRDSMGDVTLATDSEGRTTTTGYDKRRRAISVADGLGN-----TTHSTFDLAGNLT 2532
Cdd:COG3209 716 TTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVtqgtyTTRYTYDALGRLT 795
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2533 ETKAPNGATVSYAYDPANRLATITQSLDSGQA---QATITYDTSGNLLEQRDlngqstrhaydarnrrirttlpATQAGE 2609
Cdd:COG3209 796 SVTYPDGETVTYTYDALGRLTSVITVGSGGGTdlqDRTYTYDAAGNITSITD----------------------ALRAGT 853
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2610 AVQTNGYDNADGLTEHTDANGnrfvhtldirgrrtqtvttasqgngpgsvlQTTFGYDANGNLTSTAQTDSQgtrteTTT 2689
Cdd:COG3209 854 LTQTYTYDALGRLTSATDPGT------------------------------TESYTYDANGNLTSRTDGGTT-----TYT 898
|
970 980 990 1000 1010 1020 1030
....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2690 YDAFNRPVKVTDAWGNSLTHSYDPQG--NRIGTTAATASAPGGSVTTIEYDALNRRTSQSGAGGTTRISY 2757
Cdd:COG3209 899 YDALGRLVSVTKPDGTTTTYTYDALGhtDHLGSVRALTDASGQVVWRYDYDPFGNLLAETSGAAANPLRF 968
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1807-2791 |
1.41e-21 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 103.68 E-value: 1.41e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1807 VTSYTYDANGNQTSEQVSGSGGTQAVSTSQTWLAQTAPPFIKNKRLSFTDRRGLTSTASFDARGNLTGEQMPDGSRISHS 1886
Cdd:COG3209 6 LVGGTTGASSTLLAATNAGGGTAVTNAGSTVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGVTALGD 85
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1887 VAANGDRQSTTDVRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTRIT-LAAGT 1965
Cdd:COG3209 86 ASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTgLAGGG 165
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1966 ADAGQRTQTWDALGNKLSETDEEGRTTSYQYDAMGRVLRKSLPSGAIVTTYDLLGNKTSETNLRGDKTTYAYDDANRLVL 2045
Cdd:COG3209 166 ASAYGLTLGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGA 245
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2046 RTEPATPPKTTGYAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDA 2125
Cdd:COG3209 246 AGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAA 325
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2126 LNRLLSQTIAGRSKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTDYDKVGNKLQVTNPLRQTQKW 2205
Cdd:COG3209 326 VSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGA 405
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2206 QYNARNWIVAQQDGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSedsiGLLGTTAYDADGHITSQSDARGN 2285
Cdd:COG3209 406 GTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGAS----GTLTTTGGAATGATTGGGTEAGT 481
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2286 ATSFTWDAIGRQLSRSQPTAAGNAVTSTVYDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLGIASATTYDAVGNPLT 2365
Cdd:COG3209 482 GGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTG 561
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2366 QTDGRGRVLTHQYNDFNLRTATSDGLGQVGTVEYDLHGNKTGETDANGHATSYQYDALHRVIATTRAGIQLQKTEYDEAG 2445
Cdd:COG3209 562 TGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTT 641
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2446 RIQFETDARGNKVGYEYDKRGllvkTNRSLGAIDLLQRDSMGDVTLATDSEGRTTTTGYDKRRRAISVADGLGNTTHSTF 2525
Cdd:COG3209 642 GGTTGTGVTTTGTTTTRATGT----TGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTT 717
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2526 DLAGNLTETKAPNGATVSYAYDPANRLATITQSLDSGQAQATITYDTSGNLLEQRDLNGQ-----STRHAYDARNRRIRT 2600
Cdd:COG3209 718 RLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVtqgtyTTRYTYDALGRLTSV 797
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2601 TLPAtqagEAVQTNGYDNADGLTEhtdangnrfvhtldirgrrtqtvTTASQGNGPGSVLQTTFGYDANGNLTSTAQTDS 2680
Cdd:COG3209 798 TYPD----GETVTYTYDALGRLTS-----------------------VITVGSGGGTDLQDRTYTYDAAGNITSITDALR 850
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2681 QGTRTETTTYDAFNRPVKVTDAWgNSLTHSYDPQGNRIgttaataSAPGGSVTTIEYDALNRRTS-QSGAGGTTRISYDK 2759
Cdd:COG3209 851 AGTLTQTYTYDALGRLTSATDPG-TTESYTYDANGNLT-------SRTDGGTTTYTYDALGRLVSvTKPDGTTTTYTYDA 922
|
970 980 990
....*....|....*....|....*....|....*....
gi 2585425666 2760 S------GRVIQLLHPDGSSTST-RYDKAGRVAGETSST 2791
Cdd:COG3209 923 LghtdhlGSVRALTDASGQVVWRyDYDPFGNLLAETSGA 961
|
|
| RHS_core |
NF041261 |
RHS element core protein; |
1657-2431 |
1.57e-21 |
|
RHS element core protein;
Pssm-ID: 469161 [Multi-domain] Cd Length: 1261 Bit Score: 103.54 E-value: 1.57e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1657 AFEYDA--AGNLVKITRTDAPEAteSYSYSDhtgplgmSNLLLSHTNALGQATQFKYHSG-----------PVLRQFGNG 1723
Cdd:NF041261 343 AFTYDAqhPGRMVAHRYAGRPEM--CYRYDD-------TGRVTEQLNPAGLSYRYQYEQDrititdslnrrEVLHTEGEG 413
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1724 QIpsfeSTVIGVTAADGGHTAFAYNAKPEEPATdvTDARGKLTRYAFNKY-GNPLSIAGPAGTTSMTWAVNDVLMLSKTD 1802
Cdd:NF041261 414 GL----KRVVKKEHADGSVTRSGYDAAGRLTAQ--TDAAGRRTEYSLNVVsGDITDITTPDGRETKFYYNDGNQLTSVTS 487
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1803 ANGVVTSYTYDANGNQTSEqVSGSGGTqavstsqtwlaqtappfiknKRLSFTDRRGLTSTASFDArgnltgeqmpDGSR 1882
Cdd:NF041261 488 PDGLESRREYDEPGRLVSE-TSRSGET--------------------TRYRYDDPHSELPATTTDA----------TGST 536
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1883 ISHSVAANGDRQSTTDVRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTritlA 1962
Cdd:NF041261 537 KQMTWSRYGQLLAFTDCSGYQTRYEYDRFGQMTAVHREEGISTYRRYDNRGQLTSVKDAQGRETRYEYNAAGDLT----A 612
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1963 AGTADAGQRTQTWDALGNKLSETdEEGRTTSYQYDAMGRVlrkslpsgaivttydllgnkTSETNLRGDKTTYAYDDANR 2042
Cdd:NF041261 613 VITPDGNRSETQYDAWGKAVSTT-QGGLTRSMEYDAAGRI--------------------TTLTNENGSHSTFLYDALDR 671
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2043 LVlrtepatppKTTGYaydgvgnittetdalgrqtthtynhlnqrtatkfaDGTTSTAVHDGNGNKTSETDAlGRVTTYV 2122
Cdd:NF041261 672 LV---------QQRGF-----------------------------------DGRTQRYHYDLTGKLTQSEDE-GLVTLWH 706
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2123 HDALNRLLSQTIAGRSKRSMVYDASGNL--LSRTdANGNTSAFAYDalnrvvaeTDALGRVThtdydkvGNKLQVTNPLR 2200
Cdd:NF041261 707 YDESDRITHRTVNGEPAEQWQYDEHGWLtdISHL-SEGHRVAVHYG--------YDDKGRLT-------GERQTVENPET 770
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2201 QTQKWQynarnwivaqqdgegHQTRYGHDKVG--NRVTE---------TWPNG----------NIVNFEYDALNRliRSE 2259
Cdd:NF041261 771 GELLWQ---------------HETGHAYNEQGlaNRVTPdslppvewlTYGSGylagmklggtPLVEYTRDRLHR--ETV 833
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2260 DSIGLLG-------TTAYDADGHITSQS-DARGNATSFTWDAIGRQLSRSQPtaagNAVTSTVYDAAGNIVSVTTPGGNv 2331
Cdd:NF041261 834 RSFGGAGsnaayelTTAYTPAGQLQSQHlNSLVYDRDYTWNDNGDLVRISGP----RQTREYGYSATGRLTGVHTTAAN- 908
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2332 ittkydsrnrpieildsLGIASATTYDAVGNPLTQTDgrgrvlthQYNDFNLRTATSDGLGQVG--TVEYDLHGNKTGET 2409
Cdd:NF041261 909 -----------------LDIRIPYATDPAGNRLPDPE--------LHPDSTLTAWPDNRIAEDAhyVYRYDEYGRLTEKT 963
|
810 820 830
....*....|....*....|....*....|.
gi 2585425666 2410 DA---------NGHATSYQYDALHRVIATTR 2431
Cdd:NF041261 964 DRipegvirtdDERTHHYHYDSQHRLVFYTR 994
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1547-2463 |
3.76e-21 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 102.14 E-value: 3.76e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1547 NGTLTPAKGYHGTLIANNGDRSYDFYSKDGTRYHFGFIGGKRQWALQSITDTNGNALTLTYDIGVDAPLLQVQNAYGQSL 1626
Cdd:COG3209 17 LLAATNAGGGTAVTNAGSTVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGVTALGDASAAGGGYVGG 96
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1627 QFFYQTRAFVGSGAAVNVLQKVQGPEDMGLAFEYDAAGNLVKITRTDAPEATESYSYSDHTGPLGMSNLLLSHTNALGQA 1706
Cdd:COG3209 97 AAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGA 176
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1707 TQFKYHSGPVLRQFGNGQIPSFESTVIGVTAADGGHTAFAYNAKPEEPATDVTDARGKLTRYAFNKYGNPLSIAGPAGTT 1786
Cdd:COG3209 177 AAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAA 256
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1787 SMTWAVNDVLMLSKTDANGVVTSYTYDANGNQTSEQVSGSGGTQAVSTSQTWLAQTAPPFIKNKRLSFTDRRGLTSTASF 1866
Cdd:COG3209 257 TTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTT 336
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1867 DARGNLTGEQMPDGSRISHSVAANGDRQSTTDVRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRAS 1946
Cdd:COG3209 337 TTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGD 416
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1947 EMQYNTLDQLTRITLAAGTADAGQRTQTWDALGNKLSETDEEGRTTSYQYDAMGRVLRKSLPSGAIVTTYDLLGNKTSET 2026
Cdd:COG3209 417 GGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTL 496
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2027 NLRGDKTTYAYDDANRLVLRTEPATPPKTTGYAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNG 2106
Cdd:COG3209 497 GTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTG 576
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2107 NKTSETD-ALGRVTTYVHDALNRLLSQTIAGRSKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTD 2185
Cdd:COG3209 577 GASTTTGtTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTT 656
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2186 YDKVGNKLQVTNPLRQTQKWQYNARNWIVAQQDGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLL 2265
Cdd:COG3209 657 TRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTG 736
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2266 GTTAYDADGHITSQSDARGNATSFTWDAIGRQLSRSQP--TAAGNAVTSTVYDAAGNIVSVTTPGGNVITTKYDSRNRPI 2343
Cdd:COG3209 737 TGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPggVTQGTYTTRYTYDALGRLTSVTYPDGETVTYTYDALGRLT 816
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2344 EILDSLGIASAT------TYDAVGNPLTQTDGRGR---VLTHQYNDFNlRTATSDGLGQVGTVEYDLHGNKTGETDANGh 2414
Cdd:COG3209 817 SVITVGSGGGTDlqdrtyTYDAAGNITSITDALRAgtlTQTYTYDALG-RLTSATDPGTTESYTYDANGNLTSRTDGGT- 894
|
890 900 910 920 930
....*....|....*....|....*....|....*....|....*....|....*.
gi 2585425666 2415 aTSYQYDALHRVIATTRAGIQLQKTEYDEA------GRIQFETDARGNKVG-YEYD 2463
Cdd:COG3209 895 -TTYTYDALGRLVSVTKPDGTTTTYTYDALghtdhlGSVRALTDASGQVVWrYDYD 949
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
2015-2871 |
4.31e-21 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 101.76 E-value: 4.31e-21
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2015 TYDLLGNKTSETNLRGDKTTYAYDDANRLVLRTEPATPPKTTGYAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFAD 2094
Cdd:COG3209 35 TVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGVTALGDASAAGGGYVGGAAAGGGATLTGLAAATAS 114
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2095 GTTSTAVHDGNGNKTSETDALGRVTTYVHDALNRLLSQTIAGRSKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAE 2174
Cdd:COG3209 115 AGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATG 194
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2175 TDALGRVTHTDYDKVGNKLQVTNPLRQTQKWQYNARNWIVAQQDGEGHQTRYGHDKVGNRVTETWPNGNIVNFEY----- 2249
Cdd:COG3209 195 LAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGAsgagl 274
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2250 DALNRLIRSEDSIGLLGTTAYDADGHITSQSDARGNATSFTWDAIGRQLSRSQPTAAGNAVTSTVYDAAGNIVSVTTPGG 2329
Cdd:COG3209 275 DASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGS 354
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2330 NVITTKYDSRNRPIEILDSLGIASATTYDAVGNPLTQTDGRGRVLTHQYNDFNLRTATSDGLGQVGTVEYDLHGNKTGET 2409
Cdd:COG3209 355 LTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATG 434
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2410 DANGHATSYQYDALHRVIATTRAGIQLQKTEYDEAGRIQFETDARGNKVGYEYDKRGLLVKTNRSLGAIDLLQRDSMGDV 2489
Cdd:COG3209 435 TGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTA 514
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2490 TLATDSEGRTTTTGYDKRRRAISVADGLGNTTHSTFDLAGNLTETKAPNGATVSYAYDPANRLATITQSLDSGQAQATIT 2569
Cdd:COG3209 515 GARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTV 594
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2570 YDTSGNLLEQRDLNGQSTRHAYDARNRRIRTTLPATQAGEAVQTNGYDNADGLTEHTDANGNRFVHTLDIRGRRTQTVTT 2649
Cdd:COG3209 595 TTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLT 674
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2650 ASQGNGPGSVLQTTFGYDANGNLTSTAQTDSQGTRTETTTYDAFNRPVKVTDAWGNSLTHSYDPQGNRIGTTAATASAPG 2729
Cdd:COG3209 675 TLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTT 754
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2730 GSVTTIEYDALNRRTSQSGAGG------TTRISYDKSGRVIQLLHPDGSSTSTRYDKAGRVageTSSTQATAGAGNTLLD 2803
Cdd:COG3209 755 AGALTYTYDALGRLTSETTPGGvtqgtyTTRYTYDALGRLTSVTYPDGETVTYTYDALGRL---TSVITVGSGGGTDLQD 831
|
810 820 830 840 850 860
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 2585425666 2804 VAYTYDVNGNRIGSSRTESLSAANRSAALSAHlpgGSNSASHNRTRVESWTYDAQDRLTSHTTPERRT 2871
Cdd:COG3209 832 RTYTYDAAGNITSITDALRAGTLTQTYTYDAL---GRLTSATDPGTTESYTYDANGNLTSRTDGGTTT 896
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1534-2435 |
4.14e-19 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 95.59 E-value: 4.14e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1534 AGDGGFIRFLPQGNGTLTPAKGYHGTLIANNGDRSYDFYSKDGTRYHFGFIGGKRQWALQSITDTNGNALTLTYDIGVDA 1613
Cdd:COG3209 47 AAAGGAATLTARSASTTDVVGTLTGAGGTSAGGVTALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAG 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1614 PLLQVQNAYGQSLQFFYQTRAFVGSGAAVNVLQKVQGPEDMGLAFEYDAAGNLVKITRTDAPEATESYSYSDHTGPLGMS 1693
Cdd:COG3209 127 GTVTAATGGTLGATAGSATTGSTDGGRGGVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGSALLALGSG 206
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1694 NLLLsHTNALGQATQFKYHSGPVLRQFGNGQIPSFESTVIGVTAADGGHTAFAYNAKPEEPATDVTDARGKLTRYAFNKY 1773
Cdd:COG3209 207 AILG-GLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGG 285
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1774 GNPLSIAGPAGTTSMTWAVNDVLMLSKTDANGVVTSYTYDANGNQTSEQVSGSGGTQAVSTSQTWLAQTAPPFIKNKRLS 1853
Cdd:COG3209 286 SNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGG 365
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1854 FTDRRGLTSTASFDARGNLTGEQMPDGSRISHSVAANGDRQSTTDVRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRG 1933
Cdd:COG3209 366 LTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAG 445
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1934 RQIAVVDAEGRASEMQYNTLDQLTRITLAAGTADAGQRTQTWDALGNKLSETDEEGRTTSYQYDAMGRVLRKSLPSGAIV 2013
Cdd:COG3209 446 TDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGT 525
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2014 TTYDLLGNKTSETNLRGDKTTYAYDDANRLVLRTEPATPPKTTGYAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFA 2093
Cdd:COG3209 526 TLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGT 605
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2094 DGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNRLLSQTIAGRSKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVA 2173
Cdd:COG3209 606 TTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGG 685
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2174 ETDALGRVTHTDYDKVGNKLQVTNPLRQTQKWQYNARNWIVA-QQDGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDAL 2252
Cdd:COG3209 686 GTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGgGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDAL 765
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2253 NRLIRSEDSIGLLG-----TTAYDADGHITSQSDARGNATSFTWDAIGRQLSRSQPTAAGNAVTST---VYDAAGNIVSV 2324
Cdd:COG3209 766 GRLTSETTPGGVTQgtyttRYTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDLQDrtyTYDAAGNITSI 845
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2325 T---TPGGNVITTKYDSRNRPIEILDSLGIASaTTYDAVGNPLTQTDGRGRVLThqYNDFNLRTATSDGLGQVGTVEY-- 2399
Cdd:COG3209 846 TdalRAGTLTQTYTYDALGRLTSATDPGTTES-YTYDANGNLTSRTDGGTTTYT--YDALGRLVSVTKPDGTTTTYTYda 922
|
890 900 910 920
....*....|....*....|....*....|....*....|.
gi 2585425666 2400 ----DLHGNKTGETDANGHAT-SYQYDALHRVIATTRAGIQ 2435
Cdd:COG3209 923 lghtDHLGSVRALTDASGQVVwRYDYDPFGNLLAETSGAAA 963
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1896-2812 |
7.94e-19 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 94.44 E-value: 7.94e-19
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1896 TTDVRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTRITLAAGTADAGQRTQTW 1975
Cdd:COG3209 1 ETSLGLVGGTTGASSTLLAATNAGGGTAVTNAGSTVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGV 80
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1976 DALGNKLSETDEEGRTTSYQYDAMGRVLRKSLPSGAIVTTYDLLGNKTSETNLRGDKTTYAYDDANRLVLRTEPATPPKT 2055
Cdd:COG3209 81 TALGDASAAGGGYVGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGATAGSATTGSTDGGRGGVAVTG 160
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2056 TGYAYDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNRLLSQTIA 2135
Cdd:COG3209 161 LAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTG 240
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2136 GRSKRSMVYDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTDYDKVGNKLQVTNPLRQTQKWQYNARNWIVA 2215
Cdd:COG3209 241 SATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGT 320
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2216 QQDGEGHQTRYGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLLGTTAYDADGHITSQSDARGNATSFTWDAIG 2295
Cdd:COG3209 321 TGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSST 400
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2296 RQLSRSQPTAAGNAVTSTVYDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLGIASATTYDAVGNPLTQTDGRGRVLT 2375
Cdd:COG3209 401 TGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAG 480
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2376 HQYNDFNLRTATSDGLGQVGTVEYDLHGNKTGETDANGHATSYQYDALHRVIATTRAGIQLQKTEYDEAGRIQFETDARG 2455
Cdd:COG3209 481 TGGGTLTSGSAGATTLGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTST 560
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2456 NKVGYEYDkrgllVKTNRSLGAIDLLQRDSMGDVTLATDSEGRTTTTGYDKRRRAISVADGLGNTTHSTFDLAGNLTETK 2535
Cdd:COG3209 561 GTGGTGTV-----TTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATA 635
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2536 APNGATVSYAYDPANRLATITQSLDSGQAQATITYDTSGNLLEQRDLNGQSTRHAYDARNRRIRTTLPATqageAVQTNG 2615
Cdd:COG3209 636 STGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETG----TTVTTL 711
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2616 YDNADGLTEHTDANGNRFVHTLDIRGRRTQTVTTASQGNGPGSVLQTTFGYDANGNLTS--TAQTDSQGTRTETTTYDAF 2693
Cdd:COG3209 712 AGGTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSetTPGGVTQGTYTTRYTYDAL 791
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2694 NRPVKVTDAWGNSLTHSYDPQGNRIG-------------------------TTAATASAPGGSVTTIEYDALNRRTSQSG 2748
Cdd:COG3209 792 GRLTSVTYPDGETVTYTYDALGRLTSvitvgsgggtdlqdrtytydaagniTSITDALRAGTLTQTYTYDALGRLTSATD 871
|
890 900 910 920 930 940
....*....|....*....|....*....|....*....|....*....|....*....|....
gi 2585425666 2749 AGGTTRISYDKSGRVIQllHPDGSSTSTRYDKAGRVagetssTQATAGAGNTlldVAYTYDVNG 2812
Cdd:COG3209 872 PGTTESYTYDANGNLTS--RTDGGTTTYTYDALGRL------VSVTKPDGTT---TTYTYDALG 924
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1427-2380 |
4.76e-18 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 92.13 E-value: 4.76e-18
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1427 NGYAFTLTAVSDKDGSVEPNQGLVISKLMLNDALPVGNILIQGVNVKSGRINLSGMGMGVAARGPQLALRPSYSSGGSGS 1506
Cdd:COG3209 14 SSTLLAATNAGGGTAVTNAGSTVLLAKGGLSTAAAAGGAATLTARSASTTDVVGTLTGAGGTSAGGVTALGDASAAGGGY 93
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1507 VGVLGVNWGHNFDASLSTTACGDILVNAGDGGFIRFLPQGNGTLTPAkGYHGTLIANNGDRSYDFYSKDGTRYHFGFIGG 1586
Cdd:COG3209 94 VGGAAAGGGATLTGLAAATASAGRLVSTGAGAGGTVTAATGGTLGAT-AGSATTGSTDGGRGGVAVTGLAGGGASAYGLT 172
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1587 KRQWALQSITDTNGNALTLTYDIGVDAPLLQVQNAYGQSLQFFYQTRAFVGSGAAVNVLQKVQGPEDMGLAFEYDAAGNL 1666
Cdd:COG3209 173 LGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASVAATVTGSATGAAGAGAAV 252
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1667 VKITRTDAPEATESYSYSDHTGPLGMSNLLLSHTNALGQATQFKYHSGPVLRQFGNGQIPSFESTVIGVTAADGGHTAFA 1746
Cdd:COG3209 253 ATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGTTTAAGTTGTAAVSGAADA 332
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1747 YNAKPEEPATDVTDARGKLTRYAFNKYGNPLSIAGPAGTTSMTWAVNDVLMLSKTDANGVVTSYTYDANGNQTSEQVSGS 1826
Cdd:COG3209 333 GTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGSGGGSSTTGVGAGTTTTST 412
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1827 GGTQAVSTSQTWLAQTAPPFIKNKRLSFTDRRGLTSTASFDARGNLTGEQMPDGSRISHSVAANGDRQSTTDVRGNVTHM 1906
Cdd:COG3209 413 TGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTGGGTEAGTGGGTLTSGSAG 492
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1907 RYDARGMRSAVVD-ALGGTTAVRHDNRGRQIAVVDAEGRASEMQYNTLDQLTRITLAAGTADAGQRTQTWDALGNKLSET 1985
Cdd:COG3209 493 ATTLGTDTTLDDTlGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGTVGTGTSTGTGGTGTVTTTG 572
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1986 DEEGRTTSYQYDAMGRVLRKSLPSGAIVTTYDLLGNKTSETNLRGDKTTYAYDDANRLVLRTEPATPPKTTGYAYDGVGN 2065
Cdd:COG3209 573 DGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERATASTGSTTGGTTGTGVTTT 652
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2066 ITTETDALGRQTTHTYNHLNQRTATKFADGTTSTAVHDGNGNKTSETDALGRVTTYVHDALNRLLSQTIAGRSKRSMVYD 2145
Cdd:COG3209 653 GTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAGGTTTRLGTTTTGGGGGTTT 732
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2146 ASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVThtdydkvgnklQVTNPLRQTQkwqynarnwivaqqdgEGHQTR 2225
Cdd:COG3209 733 DGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLT-----------SETTPGGVTQ----------------GTYTTR 785
|
810 820 830 840 850 860 870 880
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2226 YGHDKVGNRVTETWPNGNIVNFEYDALNRLIRSEDSIGLLGTT------AYDADGHITSQSDARGNA---TSFTWDAIGR 2296
Cdd:COG3209 786 YTYDALGRLTSVTYPDGETVTYTYDALGRLTSVITVGSGGGTDlqdrtyTYDAAGNITSITDALRAGtltQTYTYDALGR 865
|
890 900 910 920 930 940 950 960
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2297 QLSRSQPTaagnAVTSTVYDAAGNIVSVTTPGGNVITtkYDSRNRPIEILDSLGIASATTYDA------VGNPLTQTDGR 2370
Cdd:COG3209 866 LTSATDPG----TTESYTYDANGNLTSRTDGGTTTYT--YDALGRLVSVTKPDGTTTTYTYDAlghtdhLGSVRALTDAS 939
|
970
....*....|
gi 2585425666 2371 GRVLTHQYND 2380
Cdd:COG3209 940 GQVVWRYDYD 949
|
|
| RhsA |
COG3209 |
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction ... |
1388-2194 |
2.88e-17 |
|
Uncharacterized conserved protein RhaS, contains 28 RHS repeats [General function prediction only];
Pssm-ID: 442442 [Multi-domain] Cd Length: 1103 Bit Score: 89.43 E-value: 2.88e-17
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1388 SDDDNPVPTGSPVRLVDGQAFAMGANSITLAPDALLPARNGYAFTLTAVSDKDGSVEPNQGLVISKLMLNDALPVGNILI 1467
Cdd:COG3209 155 GVAVTGLAGGGASAYGLTLGGAAAGPATGVGTGAVTLATGLAGSALLALGSGAILGGLAGAYSGSATTATGTALGTPASV 234
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1468 QGVNVKSGRINLSGMGMGVAARGPQLALRPSYSSGGSGSVGVLGVNWGHNFDASLSTTACGDILVNAGDGGFIRFLPQGN 1547
Cdd:COG3209 235 AATVTGSATGAAGAGAAVATAATTLGGTTGAGTGASGAGLDASTGTGGAGGSNAAATAGGLGGAGLGSGGAGGGGTAGGT 314
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1548 GTLTPAKGYHGTLIANNGDRSYDFYSKDGTRYHFGFIGGKRQWALQSITDTNGNALTLTYDIGVDAPLLQVQNAYGQSLQ 1627
Cdd:COG3209 315 TTAAGTTGTAAVSGAADAGTTTTTGTGTGGTTTTVGGGGSLTLGGYGAAGGLTTSVGAGGGGSTSGSTTTVGGGGTATGS 394
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1628 FFYQTRAFVGSGAAVNVLQKVQGPEDMGLAFEYDAAGNLVKITRTDAPEATESYSYSDHTGPLGMSNLLLSHTNALGQAT 1707
Cdd:COG3209 395 GGGSSTTGVGAGTTTTSTTGGDGGPATAAGALTAGGTATGTGTGGGGTTAGTDATTTTGGAGASGTLTTTGGAATGATTG 474
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1708 QFKYHSGPVLRQFGNGQIPSFeSTVIGVTAADGGHTAFAYNAKPEEPATDVTDARGKLTRYAFNKYGNPLSIAGPAGTTS 1787
Cdd:COG3209 475 GGTEAGTGGGTLTSGSAGATT-LGTDTTLDDTLGGTTTTTAGARGLVVTTGTTLTLGTTTTATLSATDATGTGDTTTTGT 553
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1788 MTWAVNDVLMLSKTDANGVVTSYTYDANGNQTSEQVSGSGGTQAVSTSQTWLAQTAPPFIKNKRLSFTDRRGLTSTASFD 1867
Cdd:COG3209 554 VGTGTSTGTGGTGTVTTTGDGTGGASTTTGTTGGTATTTTVTTTTTTSTAGTTTTTTSGYTRAGLTLTLGTGTASGLERA 633
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1868 ARGNLTGEQMPDGSRISHSVAANGDRQSTTDVRGNVTHMRYDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGRASE 1947
Cdd:COG3209 634 TASTGSTTGGTTGTGVTTTGTTTTRATGTTGTGTGVTAGLTTLATGGTTVGGGTGTTSTATTGATTGGTETGTTVTTLAG 713
|
570 580 590 600 610 620 630 640
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1948 MQYNTLDQLTRITLAAGTADAGQRTQTWDALGNKLSETDEEGRTTSYQYDAMGRVLRKSLPSGAIV------TTYDLLGN 2021
Cdd:COG3209 714 GTTTRLGTTTTGGGGGTTTDGTGTGGTTGTLTTTSTTTTTTAGALTYTYDALGRLTSETTPGGVTQgtyttrYTYDALGR 793
|
650 660 670 680 690 700 710 720
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2022 KTSETNLRGDKTTYAYDDANRLV----LRTEPATPPKTTGYAYDGVGNITTETDALGR---QTTHTYNHLNQRTATKFAD 2094
Cdd:COG3209 794 LTSVTYPDGETVTYTYDALGRLTsvitVGSGGGTDLQDRTYTYDAAGNITSITDALRAgtlTQTYTYDALGRLTSATDPG 873
|
730 740 750 760 770 780 790 800
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 2095 GTTSTAvHDGNGNKTSETDalGRVTTYVHDALNRLLSQTIAGRSKRSMVYDASGnllsRTDANGNTSAfAYDALNRVV-- 2172
Cdd:COG3209 874 TTESYT-YDANGNLTSRTD--GGTTTYTYDALGRLVSVTKPDGTTTTYTYDALG----HTDHLGSVRA-LTDASGQVVwr 945
|
810 820
....*....|....*....|..
gi 2585425666 2173 AETDALGRVTHTDYDKVGNKLQ 2194
Cdd:COG3209 946 YDYDPFGNLLAETSGAAANPLR 967
|
|
| YebA |
COG1305 |
Transglutaminase-like enzyme, putative cysteine protease [Posttranslational modification, ... |
184-315 |
6.35e-14 |
|
Transglutaminase-like enzyme, putative cysteine protease [Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 440916 [Multi-domain] Cd Length: 174 Bit Score: 72.34 E-value: 6.35e-14
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 184 ATGQAQGNAPVLRAAMLPVRPLGLAVRAPVSAPVILPSYeAGQEIAAMPQDVADAPEAPLNEEIVAKAKEL------DYD 257
Cdd:COG1305 1 LAGLVLAALLAALSGPLAPAPTGLLVTAGAGRGGGVASV-VPGGGTELLAGPGELLSASYDPELRALAAELtggattPYE 79
|
90 100 110 120 130 140
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 2585425666 258 YVR-IYEYVRNGIRsewYS----GSTKGALGTLRTGAGNAVDQASLLVAMLRAAGAPARYVHG 315
Cdd:COG1305 80 KARaLYDWVRDNIR---YDpgstGVGTTALETLERRRGVCRDFAHLLVALLRALGIPARYVSG 139
|
|
| Transglut_core |
pfam01841 |
Transglutaminase-like superfamily; This family includes animal transglutaminases and other ... |
243-316 |
1.76e-12 |
|
Transglutaminase-like superfamily; This family includes animal transglutaminases and other bacterial proteins of unknown function. Sequence conservation in this superfamily primarily involves three motifs that centre around conserved cysteine, histidine, and aspartate residues that form the catalytic triad in the structurally characterized transglutaminase, the human blood clotting factor XIIIa'. On the basis of the experimentally demonstrated activity of the Methanobacterium phage pseudomurein endoisopeptidase, it is proposed that many, if not all, microbial homologs of the transglutaminases are proteases and that the eukaryotic transglutaminases have evolved from an ancestral protease.
Pssm-ID: 376628 [Multi-domain] Cd Length: 108 Bit Score: 65.89 E-value: 1.76e-12
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 243 LNEEIVAKAKElDYDYVR-IYEYVRNGIrseWYSGSTKG-----ALGTLRTGAGNAVDQASLLVAMLRAAGAPARYVHGV 316
Cdd:pfam01841 3 LADRITGGATD-PLEKARaIYDYVRKNI---TYDLPGRSpgdgdAEEFLFTGKGDCEDFASLFVALLRALGIPARYVTGY 78
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2144-2180 |
9.92e-08 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 50.29 E-value: 9.92e-08
10 20 30
....*....|....*....|....*....|....*..
gi 2585425666 2144 YDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGR 2180
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2144-2185 |
2.26e-07 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 49.13 E-value: 2.26e-07
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2585425666 2144 YDASGNLLSRTDANGNTSAFAYDALNRVVAETDALGRVTHTD 2185
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| Big_1 |
pfam02369 |
Bacterial Ig-like domain (group 1); This family consists of bacterial domains with an Ig-like ... |
878-933 |
5.73e-06 |
|
Bacterial Ig-like domain (group 1); This family consists of bacterial domains with an Ig-like fold. Members of this family are found in bacterial surface proteins such as intimins and invasins involved in pathogenicity.
Pssm-ID: 460541 [Multi-domain] Cd Length: 64 Bit Score: 46.01 E-value: 5.73e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|....*.
gi 2585425666 878 SVRVRDRDGRPVKGASVSFMAvrGGGSVSPATGTTNALGVASTTVTlgqSTLASSV 933
Cdd:pfam02369 10 TATVTDANGNPVPGATVTFSA--SGGTLSASSGTTDANGQATVTLT---STKAGTV 60
|
|
| TGc |
smart00460 |
Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish ... |
284-315 |
1.55e-05 |
|
Transglutaminase/protease-like homologues; Transglutaminases are enzymes that establish covalent links between proteins. A subset of transglutaminase homologues appear to catalyse the reverse reaction, the hydrolysis of peptide bonds. Proteins with this domain are both extracellular and intracellular, and it is likely that the eukaryotic intracellular proteins are involved in signalling events.
Pssm-ID: 214673 Cd Length: 68 Bit Score: 45.07 E-value: 1.55e-05
10 20 30
....*....|....*....|....*....|..
gi 2585425666 284 TLRTGAGNAVDQASLLVAMLRAAGAPARYVHG 315
Cdd:smart00460 1 LLKTKYGTCGEFAALFVALLRSLGIPARVVSG 32
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1975-2011 |
3.29e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.97 E-value: 3.29e-05
10 20 30
....*....|....*....|....*....|....*..
gi 2585425666 1975 WDALGNKLSETDEEGRTTSYQYDAMGRVLRKSLPSGA 2011
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2399-2433 |
4.73e-05 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 42.59 E-value: 4.73e-05
10 20 30
....*....|....*....|....*....|....*
gi 2585425666 2399 YDLHGNKTGETDANGHATSYQYDALHRVIATTRAG 2433
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPD 35
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2399-2440 |
1.29e-04 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 41.42 E-value: 1.29e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2585425666 2399 YDLHGNKTGETDANGHATSYQYDALHRVIATTRAGIQLQKTE 2440
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2060-2096 |
1.42e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 41.43 E-value: 1.42e-04
10 20 30
....*....|....*....|....*....|....*..
gi 2585425666 2060 YDGVGNITTETDALGRQTTHTYNHLNQRTATKFADGT 2096
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2666-2702 |
1.71e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 41.05 E-value: 1.71e-04
10 20 30
....*....|....*....|....*....|....*..
gi 2585425666 2666 YDANGNLTStaQTDSQGTRTeTTTYDAFNRPVKVTDA 2702
Cdd:pfam05593 1 YDAAGRLTS--VTDPDGRVT-TYTYDAAGRLTAVTDP 34
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2315-2350 |
4.07e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 39.89 E-value: 4.07e-04
10 20 30
....*....|....*....|....*....|....*.
gi 2585425666 2315 YDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLG 2350
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDG 36
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2570-2604 |
6.02e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 39.50 E-value: 6.02e-04
10 20 30
....*....|....*....|....*....|....*
gi 2585425666 2570 YDTSGNLLEQRDLNGQSTRHAYDARNRRIRTTLPA 2604
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPD 35
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2315-2356 |
6.34e-04 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 39.50 E-value: 6.34e-04
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2585425666 2315 YDAAGNIVSVTTPGGNVITTKYDSRNRPIEILDSLGIASATT 2356
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2525-2556 |
7.26e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 39.12 E-value: 7.26e-04
10 20 30
....*....|....*....|....*....|..
gi 2585425666 2525 FDLAGNLTETKAPNGATVSYAYDPANRLATIT 2556
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVT 32
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2102-2133 |
9.01e-04 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 39.12 E-value: 9.01e-04
10 20 30
....*....|....*....|....*....|..
gi 2585425666 2102 HDGNGNKTSETDALGRVTTYVHDALNRLLSQT 2133
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVT 32
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2270-2303 |
1.13e-03 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 38.73 E-value: 1.13e-03
10 20 30
....*....|....*....|....*....|....
gi 2585425666 2270 YDADGHITSQSDARGNATSFTWDAIGRQLSRSQP 2303
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDP 34
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2165-2198 |
1.14e-03 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 38.73 E-value: 1.14e-03
10 20 30
....*....|....*....|....*....|....
gi 2585425666 2165 YDALNRVVAETDALGRVTHTDYDKVGNKLQVTNP 2198
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDP 34
|
|
| Bacuni_01323_like |
cd12871 |
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded ... |
1658-1779 |
2.17e-03 |
|
Uncharacterized protein conserved in Bacteroidetes; A well-conserved family of 16-stranded beta barrels resembling outer membrane porins. The interior of the barrels is mostly occupied by an insert with partially helical structure.
Pssm-ID: 214015 [Multi-domain] Cd Length: 231 Bit Score: 42.41 E-value: 2.17e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 1658 FEYDAAGNLVKITRTDAPEATE-SYSYSDhtgplgmsNLLLSHT---NALGQATQFKYHSGPVLRQFGN-GQIPSFESTV 1732
Cdd:cd12871 95 FTYNADGQLTKIVESIGTEYSTiTITWNN--------GDIVSIStksNTEENESKITYTSDKVYNPIVNkGCLMLFGLTL 166
|
90 100 110 120 130
....*....|....*....|....*....|....*....|....*....|....*.
gi 2585425666 1733 IgvtaADGGHTAFAYNAK---------PEEPATDVTDARGKLTrYAFNKYGNPLSI 1779
Cdd:cd12871 167 G----YDLSDLFYAYYAGllgkatkhlPESIIPKGNEETTTYT-YTFDKNGYPTSI 217
|
|
| PksD |
COG3321 |
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites ... |
134-635 |
2.56e-03 |
|
Acyl transferase domain in polyketide synthase (PKS) enzymes [Secondary metabolites biosynthesis, transport and catabolism];
Pssm-ID: 442550 [Multi-domain] Cd Length: 1386 Bit Score: 43.71 E-value: 2.56e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 134 SLPSSFEARRAAVQVQIDQLLQKLAAAMPGMDGDSQQKAQAVGALREALQATGQAQGNAPVLRAAMLPVRPLGLAVRAPV 213
Cdd:COG3321 862 PLPTYPFQREDAAAALLAAALAAALAAAAALGALLLAALAAALAAALLALAAAAAAALALAAAALAALLALVALAAAAAA 941
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 214 SAPVILPSYEAGQEIAAMPQDVADAPEAPLNEEIVAKAKELDYDYVRIYEYVRNGIRSEWYSGSTKGALGTLRTGAGNAV 293
Cdd:COG3321 942 LLALAAAAAAAAAALAAAEAGALLLLAAAAAAAAAAAAAAAAAAAAAAAAAAAALAAAAALALLAAAALLLAAAAAAAAL 1021
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 294 DQASLLVAMLRAAGAPARYVHGVAEIGVDGIASAAGLGDPGLVPEMLAKAGIAYSPVVQGGRVALVRMEHTWVAVQVPYT 373
Cdd:COG3321 1022 LALAALLAAAAAALAAAAAAAAAAAALAALAAAAAAAAALALALAALLLLAALAELALAAAALALAAALAAAALALALAA 1101
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 374 NYRGIVLDASGKTWLPLDVFHktLQPRPAGAGLADLGLDLQQLAMQYRSKVQSMDFGSFVREQVDAALQPKSSSYEAAAA 453
Cdd:COG3321 1102 LAAALLLLALLAALALAAAAA--ALLALAALLAAAAAAAALAAAAAAAAALALAAAAAALAAALAAALLAAAALLLALAL 1179
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 454 PMPIRAQALGLLPNTLAFTVVAATAESAALPDAVRSTARLRLFNDATGAGEAGLDISLPVHELFNQRATINYIPAELADH 533
Cdd:COG3321 1180 ALAAALAAALAGLAALLLAALLAALLAALLALALAALAAAAAALLAAAAAAAALALLALAAAAAAVAALAAAAAALLAAL 1259
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 2585425666 534 RAILLAGGL--DLAPLYLYQLRPELRLDGYQRKVGLAPLAGGSQVKFRLDIQNPANTQTVEQSFLVGAYHAIGVGQSGVA 611
Cdd:COG3321 1260 AALALLAAAagLAALAAAAAAAAAALALAAAAAAAAAALAALLAAAAAAAAAAAAAAAAAALAAALLAAALAALAAAVAA 1339
|
490 500
....*....|....*....|....
gi 2585425666 612 RSATPSARDGEYDAARLLDGIIQR 635
Cdd:COG3321 1340 ALALAAAAAAAAAAAAAAAAAAAL 1363
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2037-2077 |
2.59e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.95 E-value: 2.59e-03
10 20 30 40
....*....|....*....|....*....|....*....|.
gi 2585425666 2037 YDDANRLVLRTEPATppKTTGYAYDGVGNITTETDALGRQT 2077
Cdd:TIGR01643 1 YDAAGRLTGSTDADG--TTTRYTYDAAGRLVEITDADGGST 39
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2037-2075 |
2.63e-03 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 37.58 E-value: 2.63e-03
10 20 30
....*....|....*....|....*....|....*....
gi 2585425666 2037 YDDANRLVLRTEPATppKTTGYAYDGVGNITTETDALGR 2075
Cdd:pfam05593 1 YDAAGRLTSVTDPDG--RVTTYTYDAAGRLTAVTDPDGT 37
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
1950-1995 |
2.69e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.95 E-value: 2.69e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*.
gi 2585425666 1950 YNTLDQLTRITLAAGTadagQRTQTWDALGNKLSETDEEGRTTSYQ 1995
Cdd:TIGR01643 1 YDAAGRLTGSTDADGT----TTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2420-2461 |
4.03e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.18 E-value: 4.03e-03
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2585425666 2420 YDALHRVIATTRAGIQLQKTEYDEAGRIQFETDARGNKVGYE 2461
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2357-2393 |
4.10e-03 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 37.19 E-value: 4.10e-03
10 20 30
....*....|....*....|....*....|....*..
gi 2585425666 2357 YDAVGNPLTQTDGRGRVLTHQYNDFNLRTATSDGLGQ 2393
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2165-2206 |
4.20e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 37.18 E-value: 4.20e-03
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2585425666 2165 YDALNRVVAETDALGRVTHTDYDKVGNKLQVTNPLRQTQKWQ 2206
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
2123-2164 |
5.21e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 36.80 E-value: 5.21e-03
10 20 30 40
....*....|....*....|....*....|....*....|..
gi 2585425666 2123 HDALNRLLSQTIAGRSKRSMVYDASGNLLSRTDANGNTSAFA 2164
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDADGGSTRYE 42
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
2690-2723 |
5.79e-03 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 36.81 E-value: 5.79e-03
10 20 30
....*....|....*....|....*....|....
gi 2585425666 2690 YDAFNRPVKVTDAWGNSLTHSYDPQGNRIGTTAA 2723
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDP 34
|
|
| YfaS |
COG2373 |
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function ... |
876-941 |
6.29e-03 |
|
Uncharacterized conserved protein YfaS, alpha-2-macroglobulin family [General function prediction only];
Pssm-ID: 441940 [Multi-domain] Cd Length: 1605 Bit Score: 42.38 E-value: 6.29e-03
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 2585425666 876 SMSVRVRDR-DGRPVKGASVSFMAVRGggsVSPATGTTNALGVASTTVTLGQSTLASSVYVLVKPGD 941
Cdd:COG2373 277 GLLVFVTSLsTGKPVAGAEVELYDRNG---QVLATATTDADGLARFPAGDRGEGGRAPALLVARKGG 340
|
|
| YD_repeat_2x |
TIGR01643 |
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular ... |
1929-1975 |
8.45e-03 |
|
YD repeat (two copies); This model describes two tandem copies of a 21-residue extracellular repeat found in Gram-negative, Gram-positive, and animal proteins. The repeat is named for a YD dipeptide, the most strongly conserved motif of the repeat. These repeats appear in general to be involved in binding carbohydrate; the chicken teneurin-1 YD-repeat region has been shown to bind heparin.
Pssm-ID: 273728 [Multi-domain] Cd Length: 42 Bit Score: 36.41 E-value: 8.45e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*..
gi 2585425666 1929 HDNRGRQIAVVDAEGRASEMQYNTLDQLTRITLAagtadAGQRTQTW 1975
Cdd:TIGR01643 1 YDAAGRLTGSTDADGTTTRYTYDAAGRLVEITDA-----DGGSTRYE 42
|
|
| RHS_repeat |
pfam05593 |
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be ... |
1908-1944 |
9.86e-03 |
|
RHS Repeat; RHS proteins contain extended repeat regions. These repeats often appear to be involved in ligand binding. Note that this model may not find all the repeats in a protein and that it covers two RHS repeats. The 3D structure of an RHS-repeat-containing protein (the B and C components of an ABC toxin complex) has been determined. The RHS repeats form an extended strip of beta-sheet that spirals around to form a hollow shell, encapsulating the variable C-terminal domain.
Pssm-ID: 461685 [Multi-domain] Cd Length: 37 Bit Score: 36.04 E-value: 9.86e-03
10 20 30
....*....|....*....|....*....|....*..
gi 2585425666 1908 YDARGMRSAVVDALGGTTAVRHDNRGRQIAVVDAEGR 1944
Cdd:pfam05593 1 YDAAGRLTSVTDPDGRVTTYTYDAAGRLTAVTDPDGT 37
|
|
|