NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|334183626|ref|NP_176639|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 13595575)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

CATH:  1.25.40.10
Gene Ontology:  GO:0003723
SCOP:  4001344

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
176-503 8.37e-32

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 130.38  E-value: 8.37e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  176 DGFGF-----VPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNSGRwTDAA-RLLRDMVKRK 249
Cdd:PLN03218  424 EAFRFaklirNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGK-VDAMfEVFHEMVNAG 502
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  250 IDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLINGfCihGCLGDAKYMFDLMV-----SKGCFPDV 324
Cdd:PLN03218  503 VEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISA-C--GQSGAVDRAFDVLAemkaeTHPIDPDH 579
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  325 VTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPD----------- 393
Cdd:PLN03218  580 ITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDevffsalvdva 659
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  394 ------------------------IVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGLCRTDKLKEAW 449
Cdd:PLN03218  660 ghagdldkafeilqdarkqgiklgTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKAL 739
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 334183626  450 CLFRSLTRKGVKPDAIAYITMISGLCRKGLQREADKLCRRMKEDGFMPSERIYD 503
Cdd:PLN03218  740 EVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCR 793
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
147-196 6.33e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.60  E-value: 6.33e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  147 PSIVTLGSLLNGFCQGNRFQEAVSLVDSMDGFGFVPNVVIYNTVINGLCK 196
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
77-126 6.41e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.12  E-value: 6.41e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626   77 PSIVDFTRVLTVIAKMNKFDIVIYLYHKMENLGISHDLYSFTILIHCFCR 126
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
176-503 8.37e-32

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 130.38  E-value: 8.37e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  176 DGFGF-----VPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNSGRwTDAA-RLLRDMVKRK 249
Cdd:PLN03218  424 EAFRFaklirNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGK-VDAMfEVFHEMVNAG 502
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  250 IDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLINGfCihGCLGDAKYMFDLMV-----SKGCFPDV 324
Cdd:PLN03218  503 VEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISA-C--GQSGAVDRAFDVLAemkaeTHPIDPDH 579
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  325 VTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPD----------- 393
Cdd:PLN03218  580 ITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDevffsalvdva 659
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  394 ------------------------IVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGLCRTDKLKEAW 449
Cdd:PLN03218  660 ghagdldkafeilqdarkqgiklgTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKAL 739
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 334183626  450 CLFRSLTRKGVKPDAIAYITMISGLCRKGLQREADKLCRRMKEDGFMPSERIYD 503
Cdd:PLN03218  740 EVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCR 793
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
358-406 5.18e-17

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 74.71  E-value: 5.18e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 334183626  358 DAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNILLDCLCN 406
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
147-196 6.33e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.60  E-value: 6.33e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  147 PSIVTLGSLLNGFCQGNRFQEAVSLVDSMDGFGFVPNVVIYNTVINGLCK 196
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
290-324 1.77e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 1.77e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  290 FTYNSLINGFCIHGCLGDAKYMFDLMVSKGCFPDV 324
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
77-126 6.41e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.12  E-value: 6.41e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626   77 PSIVDFTRVLTVIAKMNKFDIVIYLYHKMENLGISHDLYSFTILIHCFCR 126
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
150-184 8.36e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 8.36e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  150 VTLGSLLNGFCQGNRFQEAVSLVDSMDGFGFVPNV 184
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
176-503 8.37e-32

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 130.38  E-value: 8.37e-32
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  176 DGFGF-----VPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNSGRwTDAA-RLLRDMVKRK 249
Cdd:PLN03218  424 EAFRFaklirNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGK-VDAMfEVFHEMVNAG 502
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  250 IDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLINGfCihGCLGDAKYMFDLMV-----SKGCFPDV 324
Cdd:PLN03218  503 VEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISA-C--GQSGAVDRAFDVLAemkaeTHPIDPDH 579
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  325 VTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPD----------- 393
Cdd:PLN03218  580 ITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDevffsalvdva 659
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  394 ------------------------IVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGLCRTDKLKEAW 449
Cdd:PLN03218  660 ghagdldkafeilqdarkqgiklgTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKAL 739
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 334183626  450 CLFRSLTRKGVKPDAIAYITMISGLCRKGLQREADKLCRRMKEDGFMPSERIYD 503
Cdd:PLN03218  740 EVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCR 793
PLN03218 PLN03218
maturation of RBCL 1; Provisional
175-481 1.69e-25

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 111.12  E-value: 1.69e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  175 MDGFGFVPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNSGRWTDAARLLRDMV--KRKIDP 252
Cdd:PLN03218  498 MVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDP 577
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  253 NVIFFTALIDTFVKEGNLLEARNLYKeMIRRSVV---PNVFTynslingFCIHGC--LGDAKY---MFDLMVSKGCFPDV 324
Cdd:PLN03218  578 DHITVGALMKACANAGQVDRAKEVYQ-MIHEYNIkgtPEVYT-------IAVNSCsqKGDWDFalsIYDDMKKKGVKPDE 649
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  325 VTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNILLDCL 404
Cdd:PLN03218  650 VFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITAL 729
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 334183626  405 CNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGLCRTDKLKEAWCLFRSLTRKGVKPDAIAYiTMISGLCrkgLQR 481
Cdd:PLN03218  730 CEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMC-RCITGLC---LRR 802
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
138-489 9.46e-24

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 105.34  E-value: 9.46e-24
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 138 GKMMKLGFRPSIVTLGSLLNGFCQGNRFQEAVSLVDSMdgfgfvP--NVVIYNTVINGLCKNRDLNNALEVFYCMEKKGI 215
Cdd:PLN03081 147 WHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEM------PerNLASWGTIIGGLVDAGNYREAFALFREMWEDGS 220
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 216 RADAVTYNTLISGLSNSGRWTDAARLLRDMVKRKIDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVpnvfTYNSL 295
Cdd:PLN03081 221 DAEPRTFVVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMPEKTTV----AWNSM 296
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 296 INGFCIHGCLGDAKYMFDLMVSKGCFPDVVTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKL 375
Cdd:PLN03081 297 LAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRM 376
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 376 NVAQKVFNRMvdcgvsP--DIVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGlCRTDKLKE-AWCLF 452
Cdd:PLN03081 377 EDARNVFDRM------PrkNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSA-CRYSGLSEqGWEIF 449
                        330       340       350
                 ....*....|....*....|....*....|....*...
gi 334183626 453 RSLTR-KGVKPDAIAYITMISGLCRKGLQREADKLCRR 489
Cdd:PLN03081 450 QSMSEnHRIKPRAMHYACMIELLGREGLLDEAYAMIRR 487
PLN03218 PLN03218
maturation of RBCL 1; Provisional
82-440 7.38e-23

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 103.03  E-value: 7.38e-23
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626   82 FTRVLTVIAKMNKFDIVIYLYHKMENLGISHDLYSFTILIHCFCRCSRLSLALALLGKMMKLGFRPSIVTLGSLLNGFCQ 161
Cdd:PLN03218  475 YTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQ 554
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  162 GNRFQEAVSLVDSM--DGFGFVPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNSGRWTDAA 239
Cdd:PLN03218  555 SGAVDRAFDVLAEMkaETHPIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFAL 634
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  240 RLLRDMVKRKIDPNVIFFTALIDTFVKEGNLLEArnlykemirrsvvpnvftynslingFCIhgcLGDAKymfdlmvSKG 319
Cdd:PLN03218  635 SIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKA-------------------------FEI---LQDAR-------KQG 679
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  320 CFPDVVTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNI 399
Cdd:PLN03218  680 IKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSI 759
                         330       340       350       360
                  ....*....|....*....|....*....|....*....|.
gi 334183626  400 LLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIqGLC 440
Cdd:PLN03218  760 LLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-GLC 799
PLN03077 PLN03077
Protein ECB2; Provisional
100-498 5.39e-22

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 99.92  E-value: 5.39e-22
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 100 YLYHKMEnlgiSHDLYSFTILIHCFCRCSRLSLALALLGKMMKLGFRPSIVTLGSLLNGfCQG----NRFQEAVSLVDSm 175
Cdd:PLN03077 142 YVFGKMP----ERDLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRT-CGGipdlARGREVHAHVVR- 215
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 176 dgFGFVPNVVIYNTVINGLCKNRDLNNALEVFYCMEkkgiRADAVTYNTLISGLSNSGRWTDAARLLRDMVKRKIDPNVI 255
Cdd:PLN03077 216 --FGFELDVDVVNALITMYVKCGDVVSARLVFDRMP----RRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLM 289
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 256 FFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLINGFCIHGCLGDAKYMFDLMVSKgcfpDVVTYNTLITGFC 335
Cdd:PLN03077 290 TITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYE 365
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 336 KSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNILLDCLCNNGKIEKALv 415
Cdd:PLN03077 366 KNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKAL- 444
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 416 mveDLQKSEMDVDIITYNIIIQGLCRTDKLKEAWCLFRSLTRKgVKPDAIAYITMISGLCRKGLQR-----EADKLCRRM 490
Cdd:PLN03077 445 ---EVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMcgkeiHAHVLRTGI 520

                 ....*...
gi 334183626 491 KEDGFMPS 498
Cdd:PLN03077 521 GFDGFLPN 528
PLN03077 PLN03077
Protein ECB2; Provisional
138-490 1.01e-21

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 99.15  E-value: 1.01e-21
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 138 GKMMKLGFRPSIVTLGSLLNGFCQGNRFQEAVSLVDSMDgfgfVPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRA 217
Cdd:PLN03077 312 GYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRME----TKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSP 387
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 218 DAVTYNTLISGLSNSGRWTDAARLLRDMVKRKIDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRsvvpNVFTYNSLIN 297
Cdd:PLN03077 388 DEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEK----DVISWTSIIA 463
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 298 GFCIHGCLGDAKYMFDLMVSKgCFPDVVTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNV 377
Cdd:PLN03077 464 GLRLNNRCFEALIFFRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNY 542
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 378 AQKVFNRMvdcgvSPDIVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGLCRTDKLKEAWCLFRSLTR 457
Cdd:PLN03077 543 AWNQFNSH-----EKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEE 617
                        330       340       350
                 ....*....|....*....|....*....|....
gi 334183626 458 K-GVKPDAIAYITMISGLCRKGLQREADKLCRRM 490
Cdd:PLN03077 618 KySITPNLKHYACVVDLLGRAGKLTEAYNFINKM 651
PLN03077 PLN03077
Protein ECB2; Provisional
139-402 3.35e-19

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 91.45  E-value: 3.35e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 139 KMMKL-GFRPSIVTLGSLLNGFCQGNRFQEAVSLVDSMDGFGFVPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKgira 217
Cdd:PLN03077 378 ALMEQdNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEK---- 453
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 218 DAVTYNTLISGLSNSGRWTDAARLLRDMvKRKIDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSV-----VPN---- 288
Cdd:PLN03077 454 DVISWTSIIAGLRLNNRCFEALIFFRQM-LLTLKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIgfdgfLPNalld 532
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 289 ---------------------VFTYNSLINGFCIHGCLGDAKYMFDLMVSKGCFPDVVTYNTLITGFCKSKRVEDGMKLF 347
Cdd:PLN03077 533 lyvrcgrmnyawnqfnshekdVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYF 612
                        250       260       270       280       290
                 ....*....|....*....|....*....|....*....|....*....|....*.
gi 334183626 348 CEMTYQ-GLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMvdcGVSPDIVTYNILLD 402
Cdd:PLN03077 613 HSMEEKySITPNLKHYACVVDLLGRAGKLTEAYNFINKM---PITPDPAVWGALLN 665
PLN03077 PLN03077
Protein ECB2; Provisional
90-493 6.61e-19

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 90.29  E-value: 6.61e-19
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  90 AKMNKFDIVIYLYHKMENLGISHDLYSFTILIHCFCRCSRLSLALALLGKMMKLGFRPSIVTLGSLLNGFCQGNRFQEAV 169
Cdd:PLN03077 163 AKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSAR 242
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 170 SLVDSMDgfgfVPNVVIYNTVINGLCKNRDLNNALEVFYCME-----------------------------------KKG 214
Cdd:PLN03077 243 LVFDRMP----RRDCISWNAMISGYFENGECLEGLELFFTMRelsvdpdlmtitsvisacellgderlgremhgyvvKTG 318
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 215 IRADAVTYNTLISGLSNSGRWTDAARLLRDMVKRkidpNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNS 294
Cdd:PLN03077 319 FAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIAS 394
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 295 LINGFCIHGCLGDAKYMFDLMVSKGCFPDVVTYNTLITGFCKSKRVEDGMKLFCEMTYQglvgDAFTYNTLIHGYC---- 370
Cdd:PLN03077 395 VLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEK----DVISWTSIIAGLRlnnr 470
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 371 ------------------------------QAGKLNVAQKVFNRMVDCGVSPDIVTYNILLDCLCNNGKIEKALVmvedl 420
Cdd:PLN03077 471 cfealiffrqmlltlkpnsvtliaalsacaRIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYAWN----- 545
                        410       420       430       440       450       460       470
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334183626 421 QKSEMDVDIITYNIIIQGLCRTDKLKEAWCLFRSLTRKGVKPDAIAYITMISGLCRKGLQREADKLCRRMKED 493
Cdd:PLN03077 546 QFNSHEKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEK 618
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
358-406 5.18e-17

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 74.71  E-value: 5.18e-17
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 334183626  358 DAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNILLDCLCN 406
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
322-371 3.80e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 72.40  E-value: 3.80e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  322 PDVVTYNTLITGFCKSKRVEDGMKLFCEMTYQGLVGDAFTYNTLIHGYCQ 371
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
287-336 4.52e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 72.01  E-value: 4.52e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  287 PNVFTYNSLINGFCIHGCLGDAKYMFDLMVSKGCFPDVVTYNTLITGFCK 336
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PLN03077 PLN03077
Protein ECB2; Provisional
154-499 4.64e-16

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 81.43  E-value: 4.64e-16
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 154 SLLNGFCQGNRFQEAVSLVDSMDGFGFVPNVVIYNTVINgLCKN-RDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNS 232
Cdd:PLN03077  56 SQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEWkRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRF 134
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 233 GRWTDAARLLRDMVKRkidpNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNS----------LINGFCIH 302
Cdd:PLN03077 135 GELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCvlrtcggipdLARGREVH 210
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 303 GCLgdAKYMFDLMVskgcfpDVVtyNTLITGFCKSKRVEDGMKLFCEMTyqglVGDAFTYNTLIHGYCQAGKLNVAQKVF 382
Cdd:PLN03077 211 AHV--VRFGFELDV------DVV--NALITMYVKCGDVVSARLVFDRMP----RRDCISWNAMISGYFENGECLEGLELF 276
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 383 NRMVDCGVSPDIVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGLCRTDKLKEAWCLFRSLTRKgvkp 462
Cdd:PLN03077 277 FTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK---- 352
                        330       340       350
                 ....*....|....*....|....*....|....*..
gi 334183626 463 DAIAYITMISGLCRKGLQREADKLCRRMKEDGFMPSE 499
Cdd:PLN03077 353 DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDE 389
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
182-231 3.13e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 67.00  E-value: 3.13e-14
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  182 PNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSN 231
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
428-476 4.15e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.62  E-value: 4.15e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 334183626  428 DIITYNIIIQGLCRTDKLKEAWCLFRSLTRKGVKPDAIAYITMISGLCR 476
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
252-300 5.30e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.23  E-value: 5.30e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 334183626  252 PNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLINGFC 300
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
392-441 1.66e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 64.69  E-value: 1.66e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  392 PDIVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIIIQGLCR 441
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
318-351 9.19e-12

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 59.28  E-value: 9.19e-12
                          10        20        30
                  ....*....|....*....|....*....|....
gi 334183626  318 KGCFPDVVTYNTLITGFCKSKRVEDGMKLFCEMT 351
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
57-355 4.82e-11

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 65.28  E-value: 4.82e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  57 CIKFDDAFSLFCEMlqsrPIPSIVDFTRVLTVIAKMNKFDIVIYLYHKMENLGISHDLYSFTILIHCFCRCSRLSLALAL 136
Cdd:PLN03081 272 CGDIEDARCVFDGM----PEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQA 347
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 137 LGKMMKLGFRPSIVTLGSLLNGFCQGNRFQEAVSLVDSMDgfgfVPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIR 216
Cdd:PLN03081 348 HAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMP----RKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVA 423
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 217 ADAVTYNTLISGLSNSGRWTDAARLLRDMVK-RKIDPNVIFFTALIDTFVKEGNLLEARNlykeMIRRS-VVPNVFTYNS 294
Cdd:PLN03081 424 PNHVTFLAVLSACRYSGLSEQGWEIFQSMSEnHRIKPRAMHYACMIELLGREGLLDEAYA----MIRRApFKPTVNMWAA 499
                        250       260       270       280       290       300
                 ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334183626 295 LINGFCIHGCLGDAKYMFDLMVSKGcfPDVV-TYNTLITGFCKSKRVEDGMKLFCEMTYQGL 355
Cdd:PLN03081 500 LLTACRIHKNLELGRLAAEKLYGMG--PEKLnNYVVLLNLYNSSGRQAEAAKVVETLKRKGL 559
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
218-266 6.63e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 57.37  E-value: 6.63e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 334183626  218 DAVTYNTLISGLSNSGRWTDAARLLRDMVKRKIDPNVIFFTALIDTFVK 266
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
354-386 3.28e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 55.04  E-value: 3.28e-10
                          10        20        30
                  ....*....|....*....|....*....|...
gi 334183626  354 GLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMV 386
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03218 PLN03218
maturation of RBCL 1; Provisional
61-301 3.61e-09

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 59.51  E-value: 3.61e-09
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626   61 DDAFSLFCEM-LQSRPI-PSIVDFTRVLTVIAKMNKFDIVIYLYHKMENLGISHDLYSFTILIHCFCRCSRLSLALALLG 138
Cdd:PLN03218  559 DRAFDVLAEMkAETHPIdPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYD 638
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  139 KMMKLGFRPSIVTLGSLLNGFCQGNRFQEAVSLVDSMDGFGFVPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRAD 218
Cdd:PLN03218  639 DMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPT 718
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  219 AVTYNTLISGLSNSGRWTDAARLLRDMVKRKIDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLInG 298
Cdd:PLN03218  719 VSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPNLVMCRCIT-G 797

                  ...
gi 334183626  299 FCI 301
Cdd:PLN03218  798 LCL 800
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
147-196 6.33e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.60  E-value: 6.33e-09
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  147 PSIVTLGSLLNGFCQGNRFQEAVSLVDSMDGFGFVPNVVIYNTVINGLCK 196
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
345-407 1.51e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 51.20  E-value: 1.51e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 334183626  345 KLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNILLDCLCNN 407
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
178-211 2.81e-08

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 49.65  E-value: 2.81e-08
                          10        20        30
                  ....*....|....*....|....*....|....
gi 334183626  178 FGFVPNVVIYNTVINGLCKNRDLNNALEVFYCME 211
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
283-316 1.22e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 47.73  E-value: 1.22e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 334183626  283 RSVVPNVFTYNSLINGFCIHGCLGDAKYMFDLMV 316
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
389-419 1.48e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 47.34  E-value: 1.48e-07
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  389 GVSPDIVTYNILLDCLCNNGKIEKALVMVED 419
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
290-324 1.77e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.45  E-value: 1.77e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  290 FTYNSLINGFCIHGCLGDAKYMFDLMVSKGCFPDV 324
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PLN03077 PLN03077
Protein ECB2; Provisional
293-497 1.81e-07

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 54.09  E-value: 1.81e-07
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 293 NSLINGFCIHGCLGDAKYMFDLMVSKGCFPDVVTYNTLITgFCKSKR-VEDGMKLFC----EMTYQGL------------ 355
Cdd:PLN03077  55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFR-LCEWKRaVEEGSRVCSralsSHPSLGVrlgnamlsmfvr 133
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 356 ---VGDA------------FTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNILLDClCNN------GKIEKAL 414
Cdd:PLN03077 134 fgeLVHAwyvfgkmperdlFSWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRT-CGGipdlarGREVHAH 212
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 415 VMVEDLqksEMDVDIItyNIIIQGLCRTDKLKEAWCLFRSLTRKgvkpDAIAYITMISGLCRKGLQREADKLCRRMKEDG 494
Cdd:PLN03077 213 VVRFGF---ELDVDVV--NALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELFFTMRELS 283

                 ...
gi 334183626 495 FMP 497
Cdd:PLN03077 284 VDP 286
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
276-331 2.19e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 47.74  E-value: 2.19e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*.
gi 334183626  276 LYKEMIRRSVVPNVFTYNSLINGFCIHGCLGDAKYMFDLMVSKGCFPDVVTYNTLI 331
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
240-299 2.44e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 47.74  E-value: 2.44e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  240 RLLRDMVKRKIDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLINGF 299
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
170-227 4.03e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.97  E-value: 4.03e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 334183626  170 SLVDSMDGFGFVPNVVIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADAVTYNTLIS 227
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILG 58
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
210-267 4.36e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.97  E-value: 4.36e-07
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 334183626  210 MEKKGIRADAVTYNTLISGLSNSGRWTDAARLLRDMVKRKIDPNVIFFTALIDTFVKE 267
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
196-319 5.00e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 50.47  E-value: 5.00e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  196 KNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNSGRWTD------AARLLR---DMVKRKIDPNVIFFTALIDTFVK 266
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLLYLCSKAADATDlkpqlaADRGFEvfeAMKAQGVSPNEATYTAVARLAAA 102
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|...
gi 334183626  267 EGNLLEARNLYKEMIRRSVVPNVFTYNSLINGFCIHGCLGDAKYMFDLMVSKG 319
Cdd:pfam17177 103 KGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHG 155
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
261-401 5.97e-07

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 50.47  E-value: 5.97e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  261 IDTFVKEGNLLEARNLYKEMIRRSVVPNVFTYNSLINgFCIHG----------CLGDAKYMFDLMVSKGCFPDVVTYNTL 330
Cdd:pfam17177  18 LDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLY-LCSKAadatdlkpqlAADRGFEVFEAMKAQGVSPNEATYTAV 96
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 334183626  331 ITgFCKSKRVED-GMKLFCEMTYQGLVGDAFTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDIVTYNILL 401
Cdd:pfam17177  97 AR-LAAAKGDGDlAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALL 167
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
261-511 1.03e-06

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 51.41  E-value: 1.03e-06
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 261 IDTFVKEGNLLEARNLYKEM-IRRSVVPNVFTYNSLINGFCIHGCLGDAKYMFDLMVSKGCFPDVVTYNTLITGFCKSKR 339
Cdd:PLN03081  94 IEKLVACGRHREALELFEILeAGCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGM 173
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 340 VEDGMKLFCEMTYQGLvgdaFTYNTLIHGYCQAGKLNVAQKVFNRMV----DC--------------------------- 388
Cdd:PLN03081 174 LIDARRLFDEMPERNL----ASWGTIIGGLVDAGNYREAFALFREMWedgsDAeprtfvvmlrasaglgsaragqqlhcc 249
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626 389 ----GVSPDIVTYNILLDCLCNNGKIEKALVMVEDL-QKSemdvdIITYNIIIQGLCRTDKLKEAWCLFRSLTRKGVKPD 463
Cdd:PLN03081 250 vlktGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMpEKT-----TVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSID 324
                        250       260       270       280
                 ....*....|....*....|....*....|....*....|....*...
gi 334183626 464 AIAYITMISGLCRKGLQREADKLCRRMKEDGFmPSERIYDETLRDHYT 511
Cdd:PLN03081 325 QFTFSIMIRIFSRLALLEHAKQAHAGLIRTGF-PLDIVANTALVDLYS 371
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
213-246 1.32e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 44.64  E-value: 1.32e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 334183626  213 KGIRADAVTYNTLISGLSNSGRWTDAARLLRDMV 246
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
360-390 1.47e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 44.76  E-value: 1.47e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  360 FTYNTLIHGYCQAGKLNVAQKVFNRMVDCGV 390
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
360-394 1.56e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.75  E-value: 1.56e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  360 FTYNTLIHGYCQAGKLNVAQKVFNRMVDCGVSPDI 394
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
220-254 2.26e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.98  E-value: 2.26e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  220 VTYNTLISGLSNSGRWTDAARLLRDMVKRKIDPNV 254
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
185-215 4.29e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 43.22  E-value: 4.29e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  185 VIYNTVINGLCKNRDLNNALEVFYCMEKKGI 215
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
458-491 5.10e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.10  E-value: 5.10e-06
                          10        20        30
                  ....*....|....*....|....*....|....
gi 334183626  458 KGVKPDAIAYITMISGLCRKGLQREADKLCRRMK 491
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
430-464 5.47e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 43.21  E-value: 5.47e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  430 ITYNIIIQGLCRTDKLKEAWCLFRSLTRKGVKPDA 464
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
77-126 6.41e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.12  E-value: 6.41e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626   77 PSIVDFTRVLTVIAKMNKFDIVIYLYHKMENLGISHDLYSFTILIHCFCR 126
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
325-355 1.11e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.07  E-value: 1.11e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  325 VTYNTLITGFCKSKRVEDGMKLFCEMTYQGL 355
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
290-320 1.21e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.07  E-value: 1.21e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  290 FTYNSLINGFCIHGCLGDAKYMFDLMVSKGC 320
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
325-350 4.56e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 4.56e-05
                          10        20
                  ....*....|....*....|....*.
gi 334183626  325 VTYNTLITGFCKSKRVEDGMKLFCEM 350
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEM 26
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
395-429 6.06e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 6.06e-05
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  395 VTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDI 429
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
462-502 1.52e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 39.27  E-value: 1.52e-04
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|.
gi 334183626  462 PDAIAYITMISGLCRKGLQREADKLCRRMKEDGFMPSERIY 502
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTY 41
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
220-250 1.63e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 38.99  E-value: 1.63e-04
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  220 VTYNTLISGLSNSGRWTDAARLLRDMVKRKI 250
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
257-289 2.35e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.59  E-value: 2.35e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 334183626  257 FTALIDTFVKEGNLLEARNLYKEMIRRSVVPNV 289
Cdd:TIGR00756   3 YNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
380-436 2.70e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.26  E-value: 2.70e-04
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 334183626  380 KVFNRMVDCGVSPDIVTYNILLDCLCNNGKIEKALVMVEDLQKSEMDVDIITYNIII 436
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
185-219 3.22e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 3.22e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  185 VIYNTVINGLCKNRDLNNALEVFYCMEKKGIRADA 219
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
144-175 3.76e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 37.71  E-value: 3.76e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 334183626  144 GFRPSIVTLGSLLNGFCQGNRFQEAVSLVDSM 175
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
428-453 6.33e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 37.33  E-value: 6.33e-04
                          10        20
                  ....*....|....*....|....*.
gi 334183626  428 DIITYNIIIQGLCRTDKLKEAWCLFR 453
Cdd:pfam12854   6 DVVTYNTLINGLCRAGRVDEAFELLD 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
150-184 8.36e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.05  E-value: 8.36e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 334183626  150 VTLGSLLNGFCQGNRFQEAVSLVDSMDGFGFVPNV 184
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
430-460 1.35e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 36.29  E-value: 1.35e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  430 ITYNIIIQGLCRTDKLKEAWCLFRSLTRKGV 460
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
451-498 2.01e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.57  E-value: 2.01e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 334183626  451 LFRSLTRKGVKPDAIAYITMISGLCRKGLQREADKLCRRMKEDGFMPS 498
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPT 49
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
395-423 2.06e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 2.06e-03
                          10        20
                  ....*....|....*....|....*....
gi 334183626  395 VTYNILLDCLCNNGKIEKALVMVEDLQKS 423
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEK 29
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
168-287 2.94e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 39.30  E-value: 2.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 334183626  168 AVSLVDSMDGFGFVPNVVIYNTVINgLC----------KNRDLNNALEVFYCMEKKGIRADAVTYNTLISGLSNSGRWTD 237
Cdd:pfam17177  30 ALALYDAAKAEGVRLAQYHYNVLLY-LCskaadatdlkPQLAADRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDL 108
                          90       100       110       120       130
                  ....*....|....*....|....*....|....*....|....*....|
gi 334183626  238 AARLLRDMVKRKIDPNVIFFTALIDTFVKEGNLLEARNLYKEMIRRSVVP 287
Cdd:pfam17177 109 AFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
101-159 3.26e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 36.18  E-value: 3.26e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 334183626  101 LYHKMENLGISHDLYSFTILIHCFCRCSRLSLALALLGKMMKLGFRPSIVTLGSLLNGF 159
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
465-495 5.35e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.75  E-value: 5.35e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  465 IAYITMISGLCRKGLQREADKLCRRMKEDGF 495
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
257-285 6.45e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 34.36  E-value: 6.45e-03
                          10        20
                  ....*....|....*....|....*....
gi 334183626  257 FTALIDTFVKEGNLLEARNLYKEMIRRSV 285
Cdd:pfam01535   3 YNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
250-280 9.76e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 33.86  E-value: 9.76e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 334183626  250 IDPNVIFFTALIDTFVKEGNLLEARNLYKEM 280
Cdd:pfam12854   3 LKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH