NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|1063729576|ref|NP_001332239|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
282-781 4.79e-38

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 153.49  E-value: 4.79e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  282 YNTLItccKRGSLhQEAAQVFEEMKAAG-FSYDKVTYNALLDVYGKSHRPKEAMKvLNEMVLNgfsPSIVTYNSLISAYA 360
Cdd:PLN03218   377 YNRLL---RDGRI-KDCIDLLEDMEKRGlLDMDKIYHAKFFKACKKQRAVKEAFR-FAKLIRN---PTLSTFNMLMSVCA 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  361 RDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKftemmk 440
Cdd:PLN03218   449 SSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQ------ 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  441 ifdeinvcglspdivtwntLLAVFGQNG-MDSevsgvfKEMKragfvPERETFNTLISAYSRCGSFEQAMtvyrrmldag 519
Cdd:PLN03218   523 -------------------VAKAFGAYGiMRS------KNVK-----PDRVVFNALISACGQSGAVDRAF---------- 562
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  520 vtpdlstyntvlaalarggmweqseKVLAEM--EDGRCKPNELTYCSLLHAYANGKEIGlmhsLAEEVYSGV----IEPR 593
Cdd:PLN03218   563 -------------------------DVLAEMkaETHPIDPDHITVGALMKACANAGQVD----RAKEVYQMIheynIKGT 613
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  594 AVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLMYM 673
Cdd:PLN03218   614 PEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGA 693
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  674 HSRSADFGKSEEILREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYAADSMFEEA 753
Cdd:PLN03218   694 CSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVG 773
                          490       500
                   ....*....|....*....|....*...
gi 1063729576  754 IGVVRYMIKHGCRPNQNTYNSIVdGYCK 781
Cdd:PLN03218   774 LDLLSQAKEDGIKPNLVMCRCIT-GLCL 800
PPR_3 super family cl37870
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
199-256 1.36e-16

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


The actual alignment was detected with superfamily member pfam13812:

Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 74.70  E-value: 1.36e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063729576 199 LQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKM 256
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
491-536 2.65e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.52  E-value: 2.65e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063729576 491 TFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALAR 536
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 super family cl38385
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
174-220 3.84e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


The actual alignment was detected with superfamily member pfam13041:

Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 41.58  E-value: 3.84e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063729576 174 SVVA--IIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFAN 220
Cdd:pfam13041   2 DVVTynTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
282-781 4.79e-38

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 153.49  E-value: 4.79e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  282 YNTLItccKRGSLhQEAAQVFEEMKAAG-FSYDKVTYNALLDVYGKSHRPKEAMKvLNEMVLNgfsPSIVTYNSLISAYA 360
Cdd:PLN03218   377 YNRLL---RDGRI-KDCIDLLEDMEKRGlLDMDKIYHAKFFKACKKQRAVKEAFR-FAKLIRN---PTLSTFNMLMSVCA 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  361 RDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKftemmk 440
Cdd:PLN03218   449 SSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQ------ 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  441 ifdeinvcglspdivtwntLLAVFGQNG-MDSevsgvfKEMKragfvPERETFNTLISAYSRCGSFEQAMtvyrrmldag 519
Cdd:PLN03218   523 -------------------VAKAFGAYGiMRS------KNVK-----PDRVVFNALISACGQSGAVDRAF---------- 562
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  520 vtpdlstyntvlaalarggmweqseKVLAEM--EDGRCKPNELTYCSLLHAYANGKEIGlmhsLAEEVYSGV----IEPR 593
Cdd:PLN03218   563 -------------------------DVLAEMkaETHPIDPDHITVGALMKACANAGQVD----RAKEVYQMIheynIKGT 613
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  594 AVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLMYM 673
Cdd:PLN03218   614 PEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGA 693
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  674 HSRSADFGKSEEILREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYAADSMFEEA 753
Cdd:PLN03218   694 CSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVG 773
                          490       500
                   ....*....|....*....|....*...
gi 1063729576  754 IGVVRYMIKHGCRPNQNTYNSIVdGYCK 781
Cdd:PLN03218   774 LDLLSQAKEDGIKPNLVMCRCIT-GLCL 800
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
371-432 2.11e-17

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 77.01  E-value: 2.11e-17
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063729576 371 LKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNR 432
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
199-256 1.36e-16

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 74.70  E-value: 1.36e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063729576 199 LQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKM 256
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
491-536 2.65e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.52  E-value: 2.65e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063729576 491 TFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALAR 536
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
700-734 1.17e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 48.22  E-value: 1.17e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063729576 700 ISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDV 734
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
209-242 6.16e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.29  E-value: 6.16e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 209 YSYTSLISAFANSGRYREAVNVFKKMEEDGCKPT 242
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
295-550 4.03e-06

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 49.34  E-value: 4.03e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 295 HQEAAQVFEEMkAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMV-LNGFSPSIvtYNSLISAYARDGMLDEAMELKN 373
Cdd:COG2956    24 PDKAIDLLEEA-LELDPETVEAHLALGNLYRRRGEYDRAIRIHQKLLeRDPDRAEA--LLELAQDYLKAGLLDRAEELLE 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 374 QMAEKGTKpDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGckPNictfNAFIKM-----YGNRGKFTEMMKIFDEinVC 448
Cdd:COG2956   101 KLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLG--PE----NAHAYCelaelYLEQGDYDEAIEALEK--AL 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 449 GLSPDIVTWNTLLA-VFGQNGMDSEVSGVFKEMKRAGfvPER-ETFNTLISAYSRCGSFEQAMTVYRRMLDAgvTPDLST 526
Cdd:COG2956   172 KLDPDCARALLLLAeLYLEQGDYEEAIAALERALEQD--PDYlPALPRLAELYEKLGDPEEALELLRKALEL--DPSDDL 247
                         250       260
                  ....*....|....*....|....
gi 1063729576 527 YNTVLAALARGGMWEQSEKVLAEM 550
Cdd:COG2956   248 LLALADLLERKEGLEAALALLERQ 271
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
174-220 3.84e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 41.58  E-value: 3.84e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063729576 174 SVVA--IIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFAN 220
Cdd:pfam13041   2 DVVTynTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
282-781 4.79e-38

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 153.49  E-value: 4.79e-38
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  282 YNTLItccKRGSLhQEAAQVFEEMKAAG-FSYDKVTYNALLDVYGKSHRPKEAMKvLNEMVLNgfsPSIVTYNSLISAYA 360
Cdd:PLN03218   377 YNRLL---RDGRI-KDCIDLLEDMEKRGlLDMDKIYHAKFFKACKKQRAVKEAFR-FAKLIRN---PTLSTFNMLMSVCA 448
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  361 RDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKftemmk 440
Cdd:PLN03218   449 SSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQ------ 522
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  441 ifdeinvcglspdivtwntLLAVFGQNG-MDSevsgvfKEMKragfvPERETFNTLISAYSRCGSFEQAMtvyrrmldag 519
Cdd:PLN03218   523 -------------------VAKAFGAYGiMRS------KNVK-----PDRVVFNALISACGQSGAVDRAF---------- 562
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  520 vtpdlstyntvlaalarggmweqseKVLAEM--EDGRCKPNELTYCSLLHAYANGKEIGlmhsLAEEVYSGV----IEPR 593
Cdd:PLN03218   563 -------------------------DVLAEMkaETHPIDPDHITVGALMKACANAGQVD----RAKEVYQMIheynIKGT 613
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  594 AVLLKTLVLVCSKCDLLPEAERAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLMYM 673
Cdd:PLN03218   614 PEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGA 693
                          410       420       430       440       450       460       470       480
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  674 HSRSADFGKSEEILREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYAADSMFEEA 753
Cdd:PLN03218   694 CSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVG 773
                          490       500
                   ....*....|....*....|....*...
gi 1063729576  754 IGVVRYMIKHGCRPNQNTYNSIVdGYCK 781
Cdd:PLN03218   774 LDLLSQAKEDGIKPNLVMCRCIT-GLCL 800
PLN03218 PLN03218
maturation of RBCL 1; Provisional
199-563 9.52e-33

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 136.55  E-value: 9.52e-33
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  199 LQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTpwnkitslVEK-------MK 271
Cdd:PLN03218   463 VQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQ--------VAKafgaygiMR 534
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  272 SDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAagfsydkvtynalldvygkshrpkEAMKVLnemvlngfsPSIVT 351
Cdd:PLN03218   535 SKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKA------------------------ETHPID---------PDHIT 581
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  352 YNSLISAYARDGMLDEAMELKNQMAE---KGTkPDVftYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKM 428
Cdd:PLN03218   582 VGALMKACANAGQVDRAKEVYQMIHEyniKGT-PEV--YTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDV 658
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  429 YGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQA 508
Cdd:PLN03218   659 AGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKA 738
                          330       340       350       360       370
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 1063729576  509 MTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNeLTYC 563
Cdd:PLN03218   739 LEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDLLSQAKEDGIKPN-LVMC 792
PLN03077 PLN03077
Protein ECB2; Provisional
213-761 1.00e-27

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 120.34  E-value: 1.00e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 213 SLISAFANSGRYREAVNVFKKMEEDgckpTLITYNVILNVFGKMGTpWNKITSLVEKMKSDGIAPDAYTYNTLITCCKRG 292
Cdd:PLN03077  126 AMLSMFVRFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGY-FDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGI 200
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 293 SLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVLNgfspSIVTYNSLISAYARDGMLDEAMELK 372
Cdd:PLN03077  201 PDLARGREVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDRMPRR----DCISWNAMISGYFENGECLEGLELF 276
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 373 NQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINvcglSP 452
Cdd:PLN03077  277 FTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRME----TK 352
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 453 DIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETF-----------------------------------NTLIS 497
Cdd:PLN03077  353 DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIasvlsacaclgdldvgvklhelaerkglisyvvvaNALIE 432
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 498 AYSRCGSFEQAMTVYRRMLDAgvtpDLSTYNTVLAALARGGMWEQSEKVLAEMEdGRCKPNELTYCSLLHAYAN------ 571
Cdd:PLN03077  433 MYSKCKCIDKALEVFHNIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLIAALSACARigalmc 507
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 572 GKEIGlMHSLAEEVYSGVIEPRAvLLKTLVlvcsKCDLLPEAERAFSELKErgfspDITTLNSMVSIYGRRQMVAKANGV 651
Cdd:PLN03077  508 GKEIH-AHVLRTGIGFDGFLPNA-LLDLYV----RCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVEL 576
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 652 LDYMKERGFTPSMATYNSLMYMHSRSADFGKSEEILREILAK-GIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRnsgI 730
Cdd:PLN03077  577 FNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEEKySITPNLKHYACVVDLLGRAGKLTEAYNFINKMP---I 653
                         570       580       590       600
                  ....*....|....*....|....*....|....*....|..
gi 1063729576 731 VPDV-----------ITYNTFIGSYAADSMFEEAIGVVRYMI 761
Cdd:PLN03077  654 TPDPavwgallnacrIHRHVELGELAAQHIFELDPNSVGYYI 695
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
122-626 1.22e-27

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 119.59  E-value: 1.22e-27
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 122 ELFE------PFKdKPESTSSELLAFLKGLgfhKKFDLALRAFdWFMKQKDYQsmLDNSVVAIIISMLGKEGRVSSAANM 195
Cdd:PLN03081  108 ELFEileagcPFT-LPASTYDALVEACIAL---KSIRCVKAVY-WHVESSGFE--PDQYMMNRVLLMHVKCGMLIDARRL 180
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 196 FNGLQEDgfslDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKMGTpwnkitslvekmksdgi 275
Cdd:PLN03081  181 FDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLGS----------------- 239
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 276 apdAYTYNTLITCCKRgslhqeaaqvfeemkaAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVlngfSPSIVTYNSL 355
Cdd:PLN03081  240 ---ARAGQQLHCCVLK----------------TGVVGDTFVSCALIDMYSKCGDIEDARCVFDGMP----EKTTVAWNSM 296
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 356 ISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKF 435
Cdd:PLN03081  297 LAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRM 376
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 436 TEMMKIFDEINVcglsPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRM 515
Cdd:PLN03081  377 EDARNVFDRMPR----KNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSM 452
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 516 L-DAGVTPDLSTYNTVLAALARGGMWEQSekvLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSgvIEPRA 594
Cdd:PLN03081  453 SeNHRIKPRAMHYACMIELLGREGLLDEA---YAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYG--MGPEK 527
                         490       500       510
                  ....*....|....*....|....*....|....*.
gi 1063729576 595 ----VLLKTLVLVCSKCDllpEAERAFSELKERGFS 626
Cdd:PLN03081  528 lnnyVVLLNLYNSSGRQA---EAAKVVETLKRKGLS 560
PLN03077 PLN03077
Protein ECB2; Provisional
266-776 2.37e-25

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 112.64  E-value: 2.37e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 266 LVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMV-LNG 344
Cdd:PLN03077   73 LLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSRVCSRALSSHPSLGVRLGNAMLSMFVRFGELVHAWYVFGKMPeRDL 152
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 345 FSpsivtYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNA 424
Cdd:PLN03077  153 FS-----WNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGREVHAHVVRFGFELDVDVVNA 227
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 425 FIKMYGNRGKFTEMMKIFDEINVcglsPDIVTWNTLLAVFGQNGMDSEVSGVFKEMK----------------------- 481
Cdd:PLN03077  228 LITMYVKCGDVVSARLVFDRMPR----RDCISWNAMISGYFENGECLEGLELFFTMRelsvdpdlmtitsvisacellgd 303
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 482 ------------RAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMldagVTPDLSTYNTVLAALARGGMWEQSEKVLAE 549
Cdd:PLN03077  304 erlgremhgyvvKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRM----ETKDAVSWTAMISGYEKNGLPDKALETYAL 379
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 550 MEDGRCKPNELTYCSLLHAYAN-GK-EIGLM-HSLAEEvySGVIEpRAVLLKTLVLVCSKCDLLPEAERAFSELKERgfs 626
Cdd:PLN03077  380 MEQDNVSPDEITIASVLSACAClGDlDVGVKlHELAER--KGLIS-YVVVANALIEMYSKCKCIDKALEVFHNIPEK--- 453
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 627 pDITTLNSMV-----------SIYGRRQMVAKANgvldymkergftPSMATYNSLMYMHSRSADFGKSEEILREILAKGI 695
Cdd:PLN03077  454 -DVISWTSIIaglrlnnrcfeALIFFRQMLLTLK------------PNSVTLIAALSACARIGALMCGKEIHAHVLRTGI 520
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 696 KPDIISYNTVIYAYCRNTRMRDASRIFSEMRNsgivpDVITYNTFIGSYAADSMFEEAIGVVRYMIKHGCRPNQNTYNSI 775
Cdd:PLN03077  521 GFDGFLPNALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISL 595

                  .
gi 1063729576 776 V 776
Cdd:PLN03077  596 L 596
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
289-794 2.81e-25

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 111.89  E-value: 2.81e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 289 CKRgslHQEAAQVFEEMKAAG-FSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDE 367
Cdd:PLN03081  100 CGR---HREALELFEILEAGCpFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLID 176
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 368 AMELKNQMAEKgtkpDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGkfteMMKIFDEINV 447
Cdd:PLN03081  177 ARRLFDEMPER----NLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVMLRASAGLG----SARAGQQLHC 248
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 448 CGLspdivtwntllavfgqngmdsevsgvfkemkRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMldagvtPDLST- 526
Cdd:PLN03081  249 CVL-------------------------------KTGVVGDTFVSCALIDMYSKCGDIEDARCVFDGM------PEKTTv 291
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 527 -YNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTycsllhayangkeiglmhslaeevysgviepravlLKTLVLVCS 605
Cdd:PLN03081  292 aWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFT-----------------------------------FSIMIRIFS 336
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 606 KCDLLPEAERAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLDYMKERgftpsmatynslmymhsrsadfgksee 685
Cdd:PLN03081  337 RLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRK--------------------------- 389
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 686 ilreilakgikpDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYAADSMFEEAIGVVRYMIK-HG 764
Cdd:PLN03081  390 ------------NLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSEnHR 457
                         490       500       510
                  ....*....|....*....|....*....|
gi 1063729576 765 CRPNQNTYNSIVDGYCKLNRKDEAKLFVED 794
Cdd:PLN03081  458 IKPRAMHYACMIELLGREGLLDEAYAMIRR 487
PLN03218 PLN03218
maturation of RBCL 1; Provisional
172-420 2.22e-22

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 103.42  E-value: 2.22e-22
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  172 DNSVVAIIISMLGKEGRVSSAanmFNGLQEDG-----FSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITY 246
Cdd:PLN03218   541 DRVVFNALISACGQSGAVDRA---FDVLAEMKaethpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVY 617
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  247 NVILNVFGKMGTpWNKITSLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGK 326
Cdd:PLN03218   618 TIAVNSCSQKGD-WDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSN 696
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576  327 SHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSI 406
Cdd:PLN03218   697 AKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILLVASERKDDADVGLDL 776
                          250
                   ....*....|....*.
gi 1063729576  407 FEEMRNAGCKPN--IC 420
Cdd:PLN03218   777 LSQAKEDGIKPNlvMC 792
PLN03077 PLN03077
Protein ECB2; Provisional
171-552 9.33e-22

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 101.08  E-value: 9.33e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 171 LDNSVVAIIISMLGKEGRVSSAANMFNGLQedgfSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVIL 250
Cdd:PLN03077  220 LDVDVVNALITMYVKCGDVVSARLVFDRMP----RRDCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVI 295
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 251 NVFGKMG--TPWNKITSLVEKMksdGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAgfsyDKVTYNALLDVYGKSH 328
Cdd:PLN03077  296 SACELLGdeRLGREMHGYVVKT---GFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNG 368
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 329 RPKEAMKVLNEMVLNGFSPSIVTY-----------------------------------NSLISAYARDGMLDEAMELKN 373
Cdd:PLN03077  369 LPDKALETYALMEQDNVSPDEITIasvlsacaclgdldvgvklhelaerkglisyvvvaNALIEMYSKCKCIDKALEVFH 448
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 374 QMAEKgtkpDVFTYTTLLSGFERAGKVESAMSIFEEMRnAGCKPNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPD 453
Cdd:PLN03077  449 NIPEK----DVISWTSIIAGLRLNNRCFEALIFFRQML-LTLKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFD 523
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 454 IVTWNTLLAVFGQNGMDSEVSGVFKEMKRagfvpERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAA 533
Cdd:PLN03077  524 GFLPNALLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCA 598
                         410
                  ....*....|....*....
gi 1063729576 534 LARGGMWEQSEKVLAEMED 552
Cdd:PLN03077  599 CSRSGMVTQGLEYFHSMEE 617
PLN03077 PLN03077
Protein ECB2; Provisional
398-786 2.72e-19

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 93.38  E-value: 2.72e-19
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 398 GKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNRGKFTEMMKifdeinVCGLSPDIVTW------NTLLAVFGQNGMDS 471
Cdd:PLN03077   65 GQLEQALKLLESMQELRVPVDEDAYVALFRLCEWKRAVEEGSR------VCSRALSSHPSlgvrlgNAMLSMFVRFGELV 138
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 472 EVSGVFKEMkragfvPERETF--NTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAAlarggmweqsekvlae 549
Cdd:PLN03077  139 HAWYVFGKM------PERDLFswNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRT---------------- 196
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 550 medgrckpneltyCSLLHAYANGKEIG---LMHSLAEEVYsgviepravLLKTLVLVCSKCDLLPEAERAFSELKERgfs 626
Cdd:PLN03077  197 -------------CGGIPDLARGREVHahvVRFGFELDVD---------VVNALITMYVKCGDVVSARLVFDRMPRR--- 251
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 627 pDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLMYMHSRSADFGKSEEILREILAKGIKPDIISYNTVI 706
Cdd:PLN03077  252 -DCISWNAMISGYFENGECLEGLELFFTMRELSVDPDLMTITSVISACELLGDERLGREMHGYVVKTGFAVDVSVCNSLI 330
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 707 YAYCRNTRMRDASRIFSEMRNSgivpDVITYNTFIGSYAADSMFEEAIGVVRYMIKHGCRPNQNTYNSIVDGYCKLNRKD 786
Cdd:PLN03077  331 QMYLSLGSWGEAEKVFSRMETK----DAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLD 406
PLN03077 PLN03077
Protein ECB2; Provisional
172-523 1.92e-17

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 87.21  E-value: 1.92e-17
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 172 DNSVVAIIISMLGKEGRVSSAANMFNGLQedgfSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPT--------- 242
Cdd:PLN03077  322 DVSVCNSLIQMYLSLGSWGEAEKVFSRME----TKDAVSWTAMISGYEKNGLPDKALETYALMEQDNVSPDeitiasvls 397
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 243 ----------------------LITYNVILNVFGKMGTPWNKI------------------TSLV--------------- 267
Cdd:PLN03077  398 acaclgdldvgvklhelaerkgLISYVVVANALIEMYSKCKCIdkalevfhnipekdviswTSIIaglrlnnrcfealif 477
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 268 -EKMKSDgIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAmkvLNEmvLNGFS 346
Cdd:PLN03077  478 fRQMLLT-LKPNSVTLIAALSACARIGALMCGKEIHAHVLRTGIGFDGFLPNALLDLYVRCGRMNYA---WNQ--FNSHE 551
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 347 PSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMR-NAGCKPNICTFNAF 425
Cdd:PLN03077  552 KDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHSMEeKYSITPNLKHYACV 631
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 426 IKMYGNRGKFTEMMKIFDEINVcglSPDIVTWNTLLAV--FGQNGMDSEVSG--VFK-EMKRAGFvperetFNTLISAYS 500
Cdd:PLN03077  632 VDLLGRAGKLTEAYNFINKMPI---TPDPAVWGALLNAcrIHRHVELGELAAqhIFElDPNSVGY------YILLCNLYA 702
                         410       420
                  ....*....|....*....|...
gi 1063729576 501 RCGSFEQAMTVYRRMLDAGVTPD 523
Cdd:PLN03077  703 DAGKWDEVARVRKTMRENGLTVD 725
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
371-432 2.11e-17

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 77.01  E-value: 2.11e-17
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063729576 371 LKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGNR 432
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
199-256 1.36e-16

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 74.70  E-value: 1.36e-16
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063729576 199 LQEDGFSLDVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGKM 256
Cdd:pfam13812   6 MVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
347-394 3.07e-16

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 73.17  E-value: 3.07e-16
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 1063729576 347 PSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGF 394
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGL 48
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
697-745 7.54e-14

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 66.23  E-value: 7.54e-14
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063729576 697 PDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSYA 745
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
382-431 1.71e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 65.46  E-value: 1.71e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063729576 382 PDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNICTFNAFIKMYGN 431
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
265-413 3.47e-13

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 69.35  E-value: 3.47e-13
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 265 SLVEKMKSDGIAPDAYTYNTLITCCKRG----SLHQEAA-----QVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMK 335
Cdd:pfam17177  32 ALYDAAKAEGVRLAQYHYNVLLYLCSKAadatDLKPQLAadrgfEVFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFD 111
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1063729576 336 VLNEMVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNA 413
Cdd:pfam17177 112 LVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAGRADKVYAYLHRLRDA 189
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
315-361 5.37e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.92  E-value: 5.37e-13
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 1063729576 315 VTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAYAR 361
Cdd:pfam13041   4 VTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
732-781 7.70e-13

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 63.54  E-value: 7.70e-13
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063729576 732 PDVITYNTFIGSYAADSMFEEAIGVVRYMIKHGCRPNQNTYNSIVDGYCK 781
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
686-744 6.18e-12

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 61.22  E-value: 6.18e-12
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1063729576 686 ILREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDVITYNTFIGSY 744
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
207-255 1.01e-11

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 60.45  E-value: 1.01e-11
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063729576 207 DVYSYTSLISAFANSGRYREAVNVFKKMEEDGCKPTLITYNVILNVFGK 255
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
269-417 3.05e-11

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 63.57  E-value: 3.05e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 269 KMKSDGIAPDAYTYNTLITCCKRGSLHQeAAQVFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMK---------VLNE 339
Cdd:pfam17177   2 RKKKGKQTPESELRFQLDKCSKHADATG-ALALYDAAKAEGVRLAQYHYNVLLYLCSKAADATDLKPqlaadrgfeVFEA 80
                          90       100       110       120       130       140       150
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1063729576 340 MVLNGFSPSIVTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGCKP 417
Cdd:pfam17177  81 MKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVEL 158
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
265-326 5.84e-11

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 58.52  E-value: 5.84e-11
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063729576 265 SLVEKMKSDGIAPDAYTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGK 326
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
301-359 3.30e-10

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 56.60  E-value: 3.30e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 1063729576 301 VFEEMKAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSIVTYNSLISAY 359
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
522-571 7.07e-10

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 55.06  E-value: 7.07e-10
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063729576 522 PDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYAN 571
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
378-411 2.23e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.12  E-value: 2.23e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 378 KGTKPDVFTYTTLLSGFERAGKVESAMSIFEEMR 411
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
693-726 2.23e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.12  E-value: 2.23e-09
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 693 KGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMR 726
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
491-536 2.65e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 53.52  E-value: 2.65e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*.
gi 1063729576 491 TFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALAR 536
Cdd:pfam13041   5 TYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
343-375 2.66e-09

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 53.12  E-value: 2.66e-09
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063729576 343 NGFSPSIVTYNSLISAYARDGMLDEAMELKNQM 375
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
452-501 1.47e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 51.21  E-value: 1.47e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063729576 452 PDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSR 501
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
277-326 1.97e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.82  E-value: 1.97e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1063729576 277 PDAYTYNTLIT-CCKRGsLHQEAAQVFEEMKAAGFSYDKVTYNALLDVYGK 326
Cdd:pfam13041   1 PDVVTYNTLINgYCKKG-KVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
241-290 2.33e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.82  E-value: 2.33e-08
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|.
gi 1063729576 241 PTLITYNVILNVFGKMGTpWNKITSLVEKMKSDGIAPDAYTYNTLIT-CCK 290
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGK-VEEAFKLFNEMKKRGVKPNVYTYTILINgLCK 50
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
440-499 4.90e-08

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 50.43  E-value: 4.90e-08
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 440 KIFDEINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAY 499
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
417-461 6.65e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 49.28  E-value: 6.65e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 1063729576 417 PNICTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDIVTWNTLL 461
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILI 45
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
720-779 1.15e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 49.28  E-value: 1.15e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 720 RIFSEMRNSGIVPDVITYNTFIGSYAADSMFEEAIGVVRYMIKHGCRPNQNTYNSIVDGY 779
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVI 60
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
700-734 1.17e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 48.22  E-value: 1.17e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063729576 700 ISYNTVIYAYCRNTRMRDASRIFSEMRNSGIVPDV 734
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
350-384 1.49e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 48.22  E-value: 1.49e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063729576 350 VTYNSLISAYARDGMLDEAMELKNQMAEKGTKPDV 384
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
385-419 1.68e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 47.84  E-value: 1.68e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063729576 385 FTYTTLLSGFERAGKVESAMSIFEEMRNAGCKPNI 419
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
273-306 2.55e-07

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 47.34  E-value: 2.55e-07
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063729576 273 DGIAPDAYTYNTLITC-CKRGSLHqEAAQVFEEMK 306
Cdd:pfam12854   1 KGLKPDVVTYNTLINGlCRAGRVD-EAFELLDEME 34
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
511-571 4.76e-07

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 47.35  E-value: 4.76e-07
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 1063729576 511 VYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLLHAYAN 571
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
350-379 4.82e-07

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 46.69  E-value: 4.82e-07
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063729576 350 VTYNSLISAYARDGMLDEAMELKNQMAEKG 379
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKG 30
PLN03077 PLN03077
Protein ECB2; Provisional
528-791 5.93e-07

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 53.31  E-value: 5.93e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 528 NTVLAALARGGMWEQSEKVLAEMEDGRCKPNELTYCSLL------HAYANGKEIgLMHSLAEEVYSGVIEPRAvLLKTLV 601
Cdd:PLN03077   55 NSQLRALCSHGQLEQALKLLESMQELRVPVDEDAYVALFrlcewkRAVEEGSRV-CSRALSSHPSLGVRLGNA-MLSMFV 132
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 602 lvcsKCDLLPEAERAFSELKERgfspDITTLNSMVSIYGRRQMVAKA--------------------------NGVLDYM 655
Cdd:PLN03077  133 ----RFGELVHAWYVFGKMPER----DLFSWNVLVGGYAKAGYFDEAlclyhrmlwagvrpdvytfpcvlrtcGGIPDLA 204
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 656 KER---------GFTPSMATYNSLMYMHSRSADFGKSeeilREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMR 726
Cdd:PLN03077  205 RGRevhahvvrfGFELDVDVVNALITMYVKCGDVVSA----RLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMR 280
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 1063729576 727 NSGIVPDVITYNTFIGsyAADSMFEEAIG--VVRYMIKHGCRPNQNTYNSIVDGYCKLNRKDEA-KLF 791
Cdd:PLN03077  281 ELSVDPDLMTITSVIS--ACELLGDERLGreMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAeKVF 346
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
209-242 6.16e-07

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 46.29  E-value: 6.16e-07
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 209 YSYTSLISAFANSGRYREAVNVFKKMEEDGCKPT 242
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
627-671 8.02e-07

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 46.20  E-value: 8.02e-07
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*
gi 1063729576 627 PDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLM 671
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILI 45
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
396-585 1.02e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 50.47  E-value: 1.02e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 396 RAGKVESAMSIFEEMRNAGCKPNICTFNAFIkmygnrgkftemmkifdeiNVCGLSPDIVtwntllAVFGQNGMDsEVSG 475
Cdd:pfam17177  23 KHADATGALALYDAAKAEGVRLAQYHYNVLL-------------------YLCSKAADAT------DLKPQLAAD-RGFE 76
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 476 VFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAALARGGMWEQSEKVLAEMEDGRC 555
Cdd:pfam17177  77 VFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGV 156
                         170       180       190
                  ....*....|....*....|....*....|....
gi 1063729576 556 KPNELTYCSLLHAYAN-GKE---IGLMHSLAEEV 585
Cdd:pfam17177 157 ELEEPELAALLKVSAKaGRAdkvYAYLHRLRDAV 190
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
476-533 1.21e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 46.20  E-value: 1.21e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*...
gi 1063729576 476 VFKEMKRAGFVPERETFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPDLSTYNTVLAA 533
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGV 59
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
315-349 1.30e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 45.52  E-value: 1.30e-06
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063729576 315 VTYNALLDVYGKSHRPKEAMKVLNEMVLNGFSPSI 349
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
209-239 1.53e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 45.15  E-value: 1.53e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 209 YSYTSLISAFANSGRYREAVNVFKKMEEDGC 239
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
662-711 1.81e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 45.43  E-value: 1.81e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 1063729576 662 PSMATYNSLMYMHSRSADFGKSEEILREILAKGIKPDIISYNTVIYAYCR 711
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
491-523 3.23e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 44.37  E-value: 3.23e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063729576 491 TFNTLISAYSRCGSFEQAMTVYRRMLDAGVTPD 523
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
295-550 4.03e-06

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 49.34  E-value: 4.03e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 295 HQEAAQVFEEMkAAGFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMV-LNGFSPSIvtYNSLISAYARDGMLDEAMELKN 373
Cdd:COG2956    24 PDKAIDLLEEA-LELDPETVEAHLALGNLYRRRGEYDRAIRIHQKLLeRDPDRAEA--LLELAQDYLKAGLLDRAEELLE 100
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 374 QMAEKGTKpDVFTYTTLLSGFERAGKVESAMSIFEEMRNAGckPNictfNAFIKM-----YGNRGKFTEMMKIFDEinVC 448
Cdd:COG2956   101 KLLELDPD-DAEALRLLAEIYEQEGDWEKAIEVLERLLKLG--PE----NAHAYCelaelYLEQGDYDEAIEALEK--AL 171
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 449 GLSPDIVTWNTLLA-VFGQNGMDSEVSGVFKEMKRAGfvPER-ETFNTLISAYSRCGSFEQAMTVYRRMLDAgvTPDLST 526
Cdd:COG2956   172 KLDPDCARALLLLAeLYLEQGDYEEAIAALERALEQD--PDYlPALPRLAELYEKLGDPEEALELLRKALEL--DPSDDL 247
                         250       260
                  ....*....|....*....|....
gi 1063729576 527 YNTVLAALARGGMWEQSEKVLAEM 550
Cdd:COG2956   248 LLALADLLERKEGLEAALALLERQ 271
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
385-415 4.50e-06

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 43.61  E-value: 4.50e-06
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 385 FTYTTLLSGFERAGKVESAMSIFEEMRNAGC 415
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
360-530 5.04e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 48.16  E-value: 5.04e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 360 ARDGMLDEAMELKNQMAEKGTKPDVFTYTTLL-----SGFERAGKVESAMS----IFEEMRNAGCKPNICTFNAFIKMYG 430
Cdd:pfam17177  22 SKHADATGALALYDAAKAEGVRLAQYHYNVLLylcskAADATDLKPQLAADrgfeVFEAMKAQGVSPNEATYTAVARLAA 101
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 431 NRGKFTEMMKIFDEINVCGLSPDIVTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVPERETFNTLISAYSRCGsfeQAMT 510
Cdd:pfam17177 102 AKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKAG---RADK 178
                         170       180
                  ....*....|....*....|...
gi 1063729576 511 VYR---RMLDAGVTPDLSTYNTV 530
Cdd:pfam17177 179 VYAylhRLRDAVRQVSESTAGVL 201
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
615-671 5.16e-06

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 44.66  E-value: 5.16e-06
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 1063729576 615 RAFSELKERGFSPDITTLNSMVSIYGRRQMVAKANGVLDYMKERGFTPSMATYNSLM 671
Cdd:pfam13812   1 SILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAIL 57
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
763-793 5.26e-06

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 43.49  E-value: 5.26e-06
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063729576 763 HGCRPNQNTYNSIVDGYCKLNRKDEA-KLFVE 793
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAfELLDE 32
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
656-788 5.96e-06

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 48.16  E-value: 5.96e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 656 KERGFTPSmATYNSLMYMHSRSADFGKSEEILREILAKGIKPDIISYNTVIY---------AYCRNTRMRDASRIFSEMR 726
Cdd:pfam17177   4 KKGKQTPE-SELRFQLDKCSKHADATGALALYDAAKAEGVRLAQYHYNVLLYlcskaadatDLKPQLAADRGFEVFEAMK 82
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063729576 727 NSGIVPDVITYNTFIGSYAADSMFEEAIGVVRYMIKHGCRPNQNTYNSIVDGYCKLNRKDEA 788
Cdd:pfam17177  83 AQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKA 144
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
651-732 1.06e-05

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 47.39  E-value: 1.06e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 651 VLDYMKERGFTPSMATYNSLMYMHSRSADFGKSEEILREILAKGIKPDIISYNTVIYAYCRNTRMRDASRIFSEMRNSGI 730
Cdd:pfam17177  77 VFEAMKAQGVSPNEATYTAVARLAAAKGDGDLAFDLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGV 156

                  ..
gi 1063729576 731 VP 732
Cdd:pfam17177 157 EL 158
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
651-712 1.22e-05

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 43.50  E-value: 1.22e-05
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063729576 651 VLDYMKERGFTPSMATYNSLMYMHSRSADFGKSEEILREILAKGIKPDIISYNTVIYAYCRN 712
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGGR 63
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
202-235 1.34e-05

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 42.33  E-value: 1.34e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 202 DGFSLDVYSYTSLISAFANSGRYREAVNVFKKME 235
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
315-345 1.87e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 42.07  E-value: 1.87e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 315 VTYNALLDVYGKSHRPKEAMKVLNEMVLNGF 345
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
174-220 3.84e-05

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 41.58  E-value: 3.84e-05
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*....
gi 1063729576 174 SVVA--IIISMLGKEGRVSSAANMFNGLQEDGFSLDVYSYTSLISAFAN 220
Cdd:pfam13041   2 DVVTynTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
700-730 5.03e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.91  E-value: 5.03e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 700 ISYNTVIYAYCRNTRMRDASRIFSEMRNSGI 730
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
491-520 5.94e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.53  E-value: 5.94e-05
                          10        20        30
                  ....*....|....*....|....*....|
gi 1063729576 491 TFNTLISAYSRCGSFEQAMTVYRRMLDAGV 520
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
280-313 7.04e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 7.04e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 280 YTYNTLITCCKRGSLHQEAAQVFEEMKAAGFSYD 313
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
735-768 7.47e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.52  E-value: 7.47e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 735 ITYNTFIGSYAADSMFEEAIGVVRYMIKHGCRPN 768
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
526-559 8.40e-05

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 40.13  E-value: 8.40e-05
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 526 TYNTVLAALARGGMWEQSEKVLAEMEDGRCKPNE 559
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
455-485 9.33e-05

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 40.14  E-value: 9.33e-05
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 455 VTWNTLLAVFGQNGMDSEVSGVFKEMKRAGF 485
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
729-760 1.41e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 39.64  E-value: 1.41e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063729576 729 GIVPDVITYNTFIGSYAADSMFEEAIGVVRYM 760
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
455-487 1.58e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 39.36  E-value: 1.58e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063729576 455 VTWNTLLAVFGQNGMDSEVSGVFKEMKRAGFVP 487
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
309-341 2.42e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.87  E-value: 2.42e-04
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063729576 309 GFSYDKVTYNALLDVYGKSHRPKEAMKVLNEMV 341
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
666-699 5.13e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.21  E-value: 5.13e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 1063729576 666 TYNSLMYMHSRSADFGKSEEILREILAKGIKPDI 699
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
420-454 9.34e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 37.43  E-value: 9.34e-04
                          10        20        30
                  ....*....|....*....|....*....|....*
gi 1063729576 420 CTFNAFIKMYGNRGKFTEMMKIFDEINVCGLSPDI 454
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEPDV 35
LapB COG2956
Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal ...
290-516 9.64e-04

Lipopolysaccharide biosynthesis regulator YciM/LapB, contains six TPR domains and a C-terminal metal-binding domain [Cell wall/membrane/envelope biogenesis];


Pssm-ID: 442196 [Multi-domain]  Cd Length: 275  Bit Score: 42.02  E-value: 9.64e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 290 KRGsLHQEAAQVFEEMKAAGfSYDKVTYNALLDVYGKSHRPKEAMKVLNEMV-LNGFSPSIvtYNSLISAYARDGMLDEA 368
Cdd:COG2956    54 RRG-EYDRAIRIHQKLLERD-PDRAEALLELAQDYLKAGLLDRAEELLEKLLeLDPDDAEA--LRLLAEIYEQEGDWEKA 129
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 369 MELKNQMAEKGTKPDVFtYTTLLSGFERAGKVESAMSIFEEMRNAgcKPNicTFNAFI---KMYGNRGKFTEMMKIFDEI 445
Cdd:COG2956   130 IEVLERLLKLGPENAHA-YCELAELYLEQGDYDEAIEALEKALKL--DPD--CARALLllaELYLEQGDYEEAIAALERA 204
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 1063729576 446 nvCGLSPDIVTWNTLLA-VFGQNGMDSEVSGVFKEMKRAGfvPERETFNTLISAYSRCGSFEQAMTVYRRML 516
Cdd:COG2956   205 --LEQDPDYLPALPRLAeLYEKLGDPEEALELLRKALELD--PSDDLLLALADLLERKEGLEAALALLERQL 272
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
280-310 1.21e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 1.21e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 280 YTYNTLITCCKRGSLHQEAAQVFEEMKAAGF 310
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
735-765 2.53e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 35.90  E-value: 2.53e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 735 ITYNTFIGSYAADSMFEEAIGVVRYMIKHGC 765
Cdd:pfam01535   1 VTYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
476-607 2.59e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 40.07  E-value: 2.59e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 1063729576 476 VFKEMKRAGFVPERETFNTLI----SAYSRCGSFEQ-----AMTVYRRMLDAGVTPDLSTYNTV--LAALARGGmwEQSE 544
Cdd:pfam17177  33 LYDAAKAEGVRLAQYHYNVLLylcsKAADATDLKPQlaadrGFEVFEAMKAQGVSPNEATYTAVarLAAAKGDG--DLAF 110
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 1063729576 545 KVLAEMEDGRCKPNELTYCSLLHAYANGKEIGLMHSLAEEVYSGVIEPRAVLLKTLVLVCSKC 607
Cdd:pfam17177 111 DLVKEMEAAGVSPRLRSYSPALHAYCEAGDADKAYEVEEHMLAHGVELEEPELAALLKVSAKA 173
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
484-515 3.12e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.78  E-value: 3.12e-03
                          10        20        30
                  ....*....|....*....|....*....|..
gi 1063729576 484 GFVPERETFNTLISAYSRCGSFEQAMTVYRRM 515
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEM 33
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
519-551 3.88e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.40  E-value: 3.88e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063729576 519 GVTPDLSTYNTVLAALARGGMWEQSEKVLAEME 551
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
631-663 6.24e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.12  E-value: 6.24e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 1063729576 631 TLNSMVSIYGRRQMVAKANGVLDYMKERGFTPS 663
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
414-444 6.28e-03

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 35.01  E-value: 6.28e-03
                          10        20        30
                  ....*....|....*....|....*....|.
gi 1063729576 414 GCKPNICTFNAFIKMYGNRGKFTEMMKIFDE 444
Cdd:pfam12854   2 GLKPDVVTYNTLINGLCRAGRVDEAFELLDE 32
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH