|
Name |
Accession |
Description |
Interval |
E-value |
| PLN03077 |
PLN03077 |
Protein ECB2; Provisional |
47-655 |
2.62e-164 |
|
Protein ECB2; Provisional
Pssm-ID: 215561 [Multi-domain] Cd Length: 857 Bit Score: 492.83 E-value: 2.62e-164
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 47 ISNNQLI-----QSLCKEGkLKQAIRVLSQESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGM 121
Cdd:PLN03077 254 ISWNAMIsgyfeNGECLEG-LELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQM 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 122 YSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVasecTVNHLMKG 201
Cdd:PLN03077 333 YLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACA----CLGDLDVG 408
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 202 KEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMRETKds 281
Cdd:PLN03077 409 VKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLTLK-- 486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 282 sPNSVTMVSVLQACASLAALEQGKLIHGYILRRG--LDSILPviSALVTMYGRCGKLEVGQRVFDrMHDRDVVSWNSLIS 359
Cdd:PLN03077 487 -PNSVTLIAALSACARIGALMCGKEIHAHVLRTGigFDGFLP--NALLDLYVRCGRMNYAWNQFN-SHEKDVVSWNILLT 562
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 360 SYGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLD 439
Cdd:PLN03077 563 GYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLT 642
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 440 EAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGL 519
Cdd:PLN03077 643 EAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGL 722
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 520 QKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMKEKGYIPQTKGVLYELETEEKErIVLGHSEKLALAFGL 599
Cdd:PLN03077 723 TVDPGCSWVEVKGKVHAFLTDDESHPQIKEINTVLEGFYEKMKASGLAGSESSSMDEIEVSKDD-IFCGHSERLAIAFGL 801
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*.
gi 219803397 600 INTSKGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGD 655
Cdd:PLN03077 802 INTVPGMPIWVTKNLYMCENCHNTVKFISKIVRREISVRDTEQFHHFKDGECSCGD 857
|
|
| DYW_deaminase |
pfam14432 |
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ... |
565-657 |
3.24e-44 |
|
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.
Pssm-ID: 464172 [Multi-domain] Cd Length: 93 Bit Score: 152.96 E-value: 3.24e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 565 GYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFH 644
Cdd:pfam14432 1 GYVPDLRFVLHDVDDEEKKQLLCGHSEKLALAYGLLTTPDGATIRITKNLRVCGDCHTAFKYISKIRGREIVVRDASRFH 80
|
90
....*....|...
gi 219803397 645 RFKNGVCSCGDYW 657
Cdd:pfam14432 81 HFKNGLCSCGDFW 93
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
369-516 |
1.05e-04 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 44.72 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 369 KAIQIFEEMLAngASPTPVTFVSVLG-ACSHEGLVEEGKRLFETMWRdhgIKPQ-IEHYACMVDLLGRANRLDEAAKMVQ 446
Cdd:COG2956 60 RAIRIHQKLLE--RDPDRAEALLELAqDYLKAGLLDRAEELLEKLLE---LDPDdAEALRLLAEIYEQEGDWEKAIEVLE 134
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 219803397 447 DMRTE-PGPKVWGSLLGSC-RIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRV-KKLLEH 516
Cdd:COG2956 135 RLLKLgPENAHAYCELAELyLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAAlERALEQ 207
|
|
| PPR |
TIGR00756 |
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ... |
352-385 |
1.77e-04 |
|
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Pssm-ID: 273253 [Multi-domain] Cd Length: 35 Bit Score: 38.98 E-value: 1.77e-04
10 20 30
....*....|....*....|....*....|....
gi 219803397 352 VSWNSLISSYGVHGYGKKAIQIFEEMLANGASPT 385
Cdd:TIGR00756 1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
|
|
|
|
Name |
Accession |
Description |
Interval |
E-value |
| PLN03077 |
PLN03077 |
Protein ECB2; Provisional |
47-655 |
2.62e-164 |
|
Protein ECB2; Provisional
Pssm-ID: 215561 [Multi-domain] Cd Length: 857 Bit Score: 492.83 E-value: 2.62e-164
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 47 ISNNQLI-----QSLCKEGkLKQAIRVLSQESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGM 121
Cdd:PLN03077 254 ISWNAMIsgyfeNGECLEG-LELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQM 332
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 122 YSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKACVasecTVNHLMKG 201
Cdd:PLN03077 333 YLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACA----CLGDLDVG 408
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 202 KEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMRETKds 281
Cdd:PLN03077 409 VKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLTLK-- 486
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 282 sPNSVTMVSVLQACASLAALEQGKLIHGYILRRG--LDSILPviSALVTMYGRCGKLEVGQRVFDrMHDRDVVSWNSLIS 359
Cdd:PLN03077 487 -PNSVTLIAALSACARIGALMCGKEIHAHVLRTGigFDGFLP--NALLDLYVRCGRMNYAWNQFN-SHEKDVVSWNILLT 562
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 360 SYGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLD 439
Cdd:PLN03077 563 GYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDLLGRAGKLT 642
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 440 EAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGL 519
Cdd:PLN03077 643 EAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRENGL 722
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 520 QKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLVKLAEDMKEKGYIPQTKGVLYELETEEKErIVLGHSEKLALAFGL 599
Cdd:PLN03077 723 TVDPGCSWVEVKGKVHAFLTDDESHPQIKEINTVLEGFYEKMKASGLAGSESSSMDEIEVSKDD-IFCGHSERLAIAFGL 801
|
570 580 590 600 610
....*....|....*....|....*....|....*....|....*....|....*.
gi 219803397 600 INTSKGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFHRFKNGVCSCGD 655
Cdd:PLN03077 802 INTVPGMPIWVTKNLYMCENCHNTVKFISKIVRREISVRDTEQFHHFKDGECSCGD 857
|
|
| PLN03081 |
PLN03081 |
pentatricopeptide (PPR) repeat-containing protein; Provisional |
78-657 |
8.89e-161 |
|
pentatricopeptide (PPR) repeat-containing protein; Provisional
Pssm-ID: 215563 [Multi-domain] Cd Length: 697 Bit Score: 478.21 E-value: 8.89e-161
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 78 QTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDYARKVFDKTRKRTIYVWNALFRALTLAG 157
Cdd:PLN03081 124 STYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEMPERNLASWGTIIGGLVDAG 203
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 158 HGEEVLGLYWKMNRIGVESDRFTYTYVLKACVASectvNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYAS 237
Cdd:PLN03081 204 NYREAFALFREMWEDGSDAEPRTFVVMLRASAGL----GSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDAR 279
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 238 YVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMmretKDS--SPNSVTMVSVLQACASLAALEQGKLIHGYILRRG 315
Cdd:PLN03081 280 CVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEM----RDSgvSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTG 355
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 316 LDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGA 395
Cdd:PLN03081 356 FPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSA 435
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 396 CSHEGLVEEGKRLFETMWRDHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDMRTEPGPKVWGSLLGSCRIHGNVELAER 475
Cdd:PLN03081 436 CRYSGLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRL 515
|
410 420 430 440 450 460 470 480
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 476 ASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWMEVRRKMYSFVSVDEFNPLMEQIHAFLV 555
Cdd:PLN03081 516 AAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKRKGLSMHPACTWIEVKKQDHSFFSGDRLHPQSREIYQKLD 595
|
490 500 510 520 530 540 550 560
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 556 KLAEDMKEKGYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHLFTKFISKFMEKEI 635
Cdd:PLN03081 596 ELMKEISEYGYVAEENELLPDVDEDEEKVSGRYHSEKLAIAFGLINTSEWTPLQITQSHRICKDCHKVIKFIALVTKREI 675
|
570 580
....*....|....*....|..
gi 219803397 636 LVRDVNRFHRFKNGVCSCGDYW 657
Cdd:PLN03081 676 VVRDASRFHHFKLGKCSCGDYW 697
|
|
| DYW_deaminase |
pfam14432 |
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases ... |
565-657 |
3.24e-44 |
|
DYW family of nucleic acid deaminases; This is the DYW domain found in nucleic acid deaminases prototyped by the plant PPR DYW proteins that are implicated in chloroplast and mitochondrial RNA transcript maturation by numerous C to U editing events. The name derives from the DYW motif present at the C-terminus of the classical plant PPR DYW deaminases. Members containing this domain are present in bacteria, plants, Naegleria, and fungi. Plants and Naegleria show lineage-specific expansions of this family. This domain contains a characteriztic zinc-binding motif (CXXC, HXE) which has been shown to bind zinc ions. This domain is often fused to PPR repeats. Ascomycete versions, which are independent lateral transfers, contain a large insert within the domain and are often fused to ankyrin repeats. Bacterial versions are predicted to function as toxins in polymorphic toxin systems.
Pssm-ID: 464172 [Multi-domain] Cd Length: 93 Bit Score: 152.96 E-value: 3.24e-44
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 565 GYIPQTKGVLYELETEEKERIVLGHSEKLALAFGLINTSKGEPIRITKNLRLCEDCHLFTKFISKFMEKEILVRDVNRFH 644
Cdd:pfam14432 1 GYVPDLRFVLHDVDDEEKKQLLCGHSEKLALAYGLLTTPDGATIRITKNLRVCGDCHTAFKYISKIRGREIVVRDASRFH 80
|
90
....*....|...
gi 219803397 645 RFKNGVCSCGDYW 657
Cdd:pfam14432 81 HFKNGLCSCGDFW 93
|
|
| PLN03077 |
PLN03077 |
Protein ECB2; Provisional |
147-450 |
3.87e-36 |
|
Protein ECB2; Provisional
Pssm-ID: 215561 [Multi-domain] Cd Length: 857 Bit Score: 145.38 E-value: 3.87e-36
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 147 NALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKAC----VASEctvnhlmkGKEIHAHLTRRGYSSHVYIMTT 222
Cdd:PLN03077 55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFRLCewkrAVEE--------GSRVCSRALSSHPSLGVRLGNA 126
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 223 LVDMYARFGCVDYASYVFGGMPVRNVVSWSAMIACYAKNGKAFEALRTFREMMreTKDSSPNSVTMVSVLQACASLAALE 302
Cdd:PLN03077 127 MLSMFVRFGELVHAWYVFGKMPERDLFSWNVLVGGYAKAGYFDEALCLYHRML--WAGVRPDVYTFPCVLRTCGGIPDLA 204
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 303 QGKLIHGYILRRGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGA 382
Cdd:PLN03077 205 RGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMRELSV 284
|
250 260 270 280 290 300
....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 219803397 383 SPTPVTFVSVLGACSHEGLVEEGKRLFETMWRdHGIKPQIEHYACMVDLLGRANRLDEAAKMVQDMRT 450
Cdd:PLN03077 285 DPDLMTITSVISACELLGDERLGREMHGYVVK-TGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMET 351
|
|
| E_motif |
pfam20431 |
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) ... |
467-529 |
1.48e-17 |
|
E motif; This entry represents the E motif found in plant pentatricopeptide repeat (PPR) proteins which contain a DYW deaminase domain. The DYW domain is required for RNA editing, a process that deaminates specific cytidines to uridines. This motif, together with the E+ motif, precedes the DYW domain and, although their role is not clear, they are essential in the RNA editing reaction. The E/E+ motifs may contain two degenerate PPR motifs that could be involved in RNA or protein binding.
Pssm-ID: 466580 [Multi-domain] Cd Length: 63 Bit Score: 77.20 E-value: 1.48e-17
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|...
gi 219803397 467 HGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVKKLLEHRGLQKLPGRCWME 529
Cdd:pfam20431 1 YSNVELAEKAANILLELEKTNDGNYTLLSNIYAYAGRWKDVERIRKLMKSSGIKKRPGCSWIE 63
|
|
| PLN03218 |
PLN03218 |
maturation of RBCL 1; Provisional |
57-450 |
5.60e-15 |
|
maturation of RBCL 1; Provisional
Pssm-ID: 215636 [Multi-domain] Cd Length: 1060 Bit Score: 78.77 E-value: 5.60e-15
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 57 CK-EGKLKQAIRVLSQESSPSQQTYELLILCCGHRSSLSDALRVHRHILDNGSDQDPFLATKLIGMYSDLGSVDyarkvf 135
Cdd:PLN03218 416 CKkQRAVKEAFRFAKLIRNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVD------ 489
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 136 dktrkrtiyvwnALFraltlaghgeEVlglYWKMNRIGVESDRFTYTYVLKACvasectvnhlmkgkeihahltrrgyss 215
Cdd:PLN03218 490 ------------AMF----------EV---FHEMVNAGVEANVHTFGALIDGC--------------------------- 517
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 216 hvyimttlvdmyARFGCVDYASYVFGGMPVRNV----VSWSAMIACYAKNGKAFEALRTFREMMRETKDSSPNSVTMVSV 291
Cdd:PLN03218 518 ------------ARAGQVAKAFGAYGIMRSKNVkpdrVVFNALISACGQSGAVDRAFDVLAEMKAETHPIDPDHITVGAL 585
|
250 260 270 280 290 300 310 320
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 292 LQACASLA----ALEQGKLIHGYilrrGLDSILPVISALVTMYGRCGKLEVGQRVFDRMHDRDV----VSWNSLISSYGV 363
Cdd:PLN03218 586 MKACANAGqvdrAKEVYQMIHEY----NIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVkpdeVFFSALVDVAGH 661
|
330 340 350 360 370 380 390 400
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 364 HGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVEEGKRLFETMwRDHGIKPQIEHYACMVDLLGRANRLDEAAK 443
Cdd:PLN03218 662 AGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDI-KSIKLRPTVSTMNALITALCEGNQLPKALE 740
|
....*..
gi 219803397 444 MVQDMRT 450
Cdd:PLN03218 741 VLSEMKR 747
|
|
| PLN03218 |
PLN03218 |
maturation of RBCL 1; Provisional |
180-473 |
2.31e-13 |
|
maturation of RBCL 1; Provisional
Pssm-ID: 215636 [Multi-domain] Cd Length: 1060 Bit Score: 73.76 E-value: 2.31e-13
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 180 TYTYVLKACvaseCTVNHLMKGKEIHAHLTRRGYSSHVYIMTTLVDMYARFGCVDYASYVFGGMPVR----NVVSWSAMI 255
Cdd:PLN03218 439 TFNMLMSVC----ASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAgveaNVHTFGALI 514
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 256 ACYAKNG---KAFEALRTFRemmreTKDSSPNSVtmvsvlqacaslaaleqgklihgyilrrgldsilpVISALVTMYGR 332
Cdd:PLN03218 515 DGCARAGqvaKAFGAYGIMR-----SKNVKPDRV-----------------------------------VFNALISACGQ 554
|
170 180 190 200 210 220 230 240
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 333 CGKLevgQRVFDRMH---------DRDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSHEGLVE 403
Cdd:PLN03218 555 SGAV---DRAFDVLAemkaethpiDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWD 631
|
250 260 270 280 290 300 310
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 219803397 404 EGKRLFETMWRDhGIKPQIEHYACMVDLLGRANRLDEAAKMVQDMRTE---PGPKVWGSLLGSCRIHGNVELA 473
Cdd:PLN03218 632 FALSIYDDMKKK-GVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQgikLGTVSYSSLMGACSNAKNWKKA 703
|
|
| PPR_2 |
pfam13041 |
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ... |
349-398 |
1.29e-09 |
|
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
Pssm-ID: 463778 [Multi-domain] Cd Length: 50 Bit Score: 53.91 E-value: 1.29e-09
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|
gi 219803397 349 RDVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGACSH 398
Cdd:pfam13041 1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
|
|
| PPR_2 |
pfam13041 |
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ... |
246-296 |
1.19e-06 |
|
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
Pssm-ID: 463778 [Multi-domain] Cd Length: 50 Bit Score: 45.82 E-value: 1.19e-06
10 20 30 40 50
....*....|....*....|....*....|....*....|....*....|.
gi 219803397 246 RNVVSWSAMIACYAKNGKAFEALRTFREMMRetKDSSPNSVTMVSVLQACA 296
Cdd:pfam13041 1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKK--RGVKPNVYTYTILINGLC 49
|
|
| PPR |
pfam01535 |
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ... |
352-381 |
6.92e-06 |
|
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.
Pssm-ID: 366695 [Multi-domain] Cd Length: 31 Bit Score: 42.84 E-value: 6.92e-06
10 20 30
....*....|....*....|....*....|
gi 219803397 352 VSWNSLISSYGVHGYGKKAIQIFEEMLANG 381
Cdd:pfam01535 1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
|
|
| PPR_3 |
pfam13812 |
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ... |
341-396 |
9.48e-06 |
|
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.
Pssm-ID: 316342 [Multi-domain] Cd Length: 63 Bit Score: 43.50 E-value: 9.48e-06
10 20 30 40 50 60
....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 341 RVFDRMHDR----DVVSWNSLISSYGVHGYGKKAIQIFEEMLANGASPTPVTFVSVLGAC 396
Cdd:pfam13812 1 SILREMVRDgiqlNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
|
|
| PPR |
pfam01535 |
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ... |
249-277 |
2.84e-05 |
|
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.
Pssm-ID: 366695 [Multi-domain] Cd Length: 31 Bit Score: 41.30 E-value: 2.84e-05
10 20
....*....|....*....|....*....
gi 219803397 249 VSWSAMIACYAKNGKAFEALRTFREMMRE 277
Cdd:pfam01535 1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
369-516 |
1.05e-04 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 44.72 E-value: 1.05e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 369 KAIQIFEEMLAngASPTPVTFVSVLG-ACSHEGLVEEGKRLFETMWRdhgIKPQ-IEHYACMVDLLGRANRLDEAAKMVQ 446
Cdd:COG2956 60 RAIRIHQKLLE--RDPDRAEALLELAqDYLKAGLLDRAEELLEKLLE---LDPDdAEALRLLAEIYEQEGDWEKAIEVLE 134
|
90 100 110 120 130 140 150
....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 219803397 447 DMRTE-PGPKVWGSLLGSC-RIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRV-KKLLEH 516
Cdd:COG2956 135 RLLKLgPENAHAYCELAELyLEQGDYDEAIEALEKALKLDPDCARALLLLAELYLEQGDYEEAIAAlERALEQ 207
|
|
| BepA |
COG4783 |
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell ... |
399-515 |
1.35e-04 |
|
Outer membrane protein chaperone/metalloprotease BepA/YfgC, contains M48 and TPR domains [Cell wall/membrane/envelope biogenesis, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443813 [Multi-domain] Cd Length: 139 Bit Score: 42.49 E-value: 1.35e-04
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 399 EGLVEEGKRLFETMWRDHgikPQ-IEHYACMVDLLGRANRLDEAAKMVQD-MRTEPGPKVWGSLLGSCRIH-GNVELAER 475
Cdd:COG4783 17 AGDYDEAEALLEKALELD---PDnPEAFALLGEILLQLGDLDEAIVLLHEaLELDPDEPEARLNLGLALLKaGDYDEALA 93
|
90 100 110 120
....*....|....*....|....*....|....*....|.
gi 219803397 476 ASRRLFALEPKNAGNYVLLADIYAEAQMWDE-VKRVKKLLE 515
Cdd:COG4783 94 LLEKALKLDPEHPEAYLRLARAYRALGRPDEaIAALEKALE 134
|
|
| PPR |
TIGR00756 |
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ... |
352-385 |
1.77e-04 |
|
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Pssm-ID: 273253 [Multi-domain] Cd Length: 35 Bit Score: 38.98 E-value: 1.77e-04
10 20 30
....*....|....*....|....*....|....
gi 219803397 352 VSWNSLISSYGVHGYGKKAIQIFEEMLANGASPT 385
Cdd:TIGR00756 1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
|
|
| PPR_2 |
pfam13041 |
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ... |
141-188 |
4.28e-04 |
|
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.
Pssm-ID: 463778 [Multi-domain] Cd Length: 50 Bit Score: 38.50 E-value: 4.28e-04
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 219803397 141 RTIYVWNALFRALTLAGHGEEVLGLYWKMNRIGVESDRFTYTYVLKAC 188
Cdd:pfam13041 1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
|
|
| PPR |
TIGR00756 |
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ... |
249-277 |
5.87e-04 |
|
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Pssm-ID: 273253 [Multi-domain] Cd Length: 35 Bit Score: 37.82 E-value: 5.87e-04
10 20
....*....|....*....|....*....
gi 219803397 249 VSWSAMIACYAKNGKAFEALRTFREMMRE 277
Cdd:TIGR00756 1 VTYNTLIDGLCKAGRVEEALELFKEMKER 29
|
|
| LapB |
COG2956 |
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ... |
325-516 |
1.83e-03 |
|
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];
Pssm-ID: 442196 [Multi-domain] Cd Length: 275 Bit Score: 40.87 E-value: 1.83e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 325 ALVTMYGRCGKLEVGQRVFDRMHDRD---VVSWNSLISSYGVHGYGKKAIQIFEEMLANGASpTPVTFVSVLGACSHEGL 401
Cdd:COG2956 47 ALGNLYRRRGEYDRAIRIHQKLLERDpdrAEALLELAQDYLKAGLLDRAEELLEKLLELDPD-DAEALRLLAEIYEQEGD 125
|
90 100 110 120 130 140 150 160
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 402 VEEGKRLFETMwrdHGIKPQIEHYAC-MVDLLGRANRLDEAAKMVQDMRTEpGPKVWGSLLGSCRIH---GNVELAERAS 477
Cdd:COG2956 126 WEKAIEVLERL---LKLGPENAHAYCeLAELYLEQGDYDEAIEALEKALKL-DPDCARALLLLAELYleqGDYEEAIAAL 201
|
170 180 190 200
....*....|....*....|....*....|....*....|
gi 219803397 478 RRLFALEPKNAGNYVLLADIYAEAQMWDE-VKRVKKLLEH 516
Cdd:COG2956 202 ERALEQDPDYLPALPRLAELYEKLGDPEEaLELLRKALEL 241
|
|
| PilF |
COG3063 |
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures]; |
468-515 |
3.75e-03 |
|
Type IV pilus assembly protein PilF/PilW [Cell motility, Extracellular structures];
Pssm-ID: 442297 [Multi-domain] Cd Length: 94 Bit Score: 37.07 E-value: 3.75e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*...
gi 219803397 468 GNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRVKKLLE 515
Cdd:COG3063 6 GDLEEAEEYYEKALELDPDNADALNNLGLLLLEQGRYDEAIALEKALK 53
|
|
| NrfG |
COG4235 |
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ... |
468-515 |
5.35e-03 |
|
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443378 [Multi-domain] Cd Length: 131 Bit Score: 37.68 E-value: 5.35e-03
10 20 30 40
....*....|....*....|....*....|....*....|....*....
gi 219803397 468 GNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMWDEVKRV-KKLLE 515
Cdd:COG4235 31 GRYDEALAAYEKALRLDPDNADALLDLAEALLAAGDTEEAEELlERALA 79
|
|
| NrfG |
COG4235 |
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, ... |
431-515 |
5.62e-03 |
|
Cytochrome c-type biogenesis protein CcmH/NrfG [Energy production and conversion, Posttranslational modification, protein turnover, chaperones];
Pssm-ID: 443378 [Multi-domain] Cd Length: 131 Bit Score: 37.68 E-value: 5.62e-03
10 20 30 40 50 60 70 80
....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 219803397 431 LLGRA----NRLDEAAKMVQD-MRTEPG-PKVWGSLLGSCRIHGNVELAERASRRLFALEPKNAGNYVLLADIYAEAQMW 504
Cdd:COG4235 22 LLGRAylrlGRYDEALAAYEKaLRLDPDnADALLDLAEALLAAGDTEEAEELLERALALDPDNPEALYLLGLAAFQQGDY 101
|
90
....*....|..
gi 219803397 505 DE-VKRVKKLLE 515
Cdd:COG4235 102 AEaIAAWQKLLA 113
|
|
| PPR_1 |
pfam12854 |
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ... |
345-377 |
6.52e-03 |
|
PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.
Pssm-ID: 403914 [Multi-domain] Cd Length: 34 Bit Score: 34.63 E-value: 6.52e-03
10 20 30
....*....|....*....|....*....|...
gi 219803397 345 RMHDRDVVSWNSLISSYGVHGYGKKAIQIFEEM 377
Cdd:pfam12854 1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
|
|
|