NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|15238810|ref|NP_197340|]
View 

Pentatricopeptide repeat (PPR) superfamily protein [Arabidopsis thaliana]

Protein Classification

pentatricopeptide repeat-containing protein( domain architecture ID 1000098)

pentatricopeptide repeat (PPR)-containing protein may form anti-parallel alpha helices and bind single-stranded RNA in a sequence-specific and modular manner

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PLN03218 super family cl33664
maturation of RBCL 1; Provisional
181-425 7.02e-16

maturation of RBCL 1; Provisional


The actual alignment was detected with superfamily member PLN03218:

Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 80.31  E-value: 7.02e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   181 TVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIE 260
Cdd:PLN03218  436 TLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALID 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   261 GLLNAGYLESA----KEMVSKMTKggfvPDIQTFNILIEAISKSGEVE--FCI--EMYYTACKlglcVDID--TYKTLIP 330
Cdd:PLN03218  516 GCARAGQVAKAfgayGIMRSKNVK----PDRVVFNALISACGQSGAVDraFDVlaEMKAETHP----IDPDhiTVGALMK 587
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   331 AVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMCGRGGKFVD 410
Cdd:PLN03218  588 ACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDK 667
                         250
                  ....*....|....*
gi 15238810   411 AANYLVEMTEMGLVP 425
Cdd:PLN03218  668 AFEILQDARKQGIKL 682
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
181-425 7.02e-16

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 80.31  E-value: 7.02e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   181 TVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIE 260
Cdd:PLN03218  436 TLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALID 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   261 GLLNAGYLESA----KEMVSKMTKggfvPDIQTFNILIEAISKSGEVE--FCI--EMYYTACKlglcVDID--TYKTLIP 330
Cdd:PLN03218  516 GCARAGQVAKAfgayGIMRSKNVK----PDRVVFNALISACGQSGAVDraFDVlaEMKAETHP----IDPDhiTVGALMK 587
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   331 AVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMCGRGGKFVD 410
Cdd:PLN03218  588 ACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDK 667
                         250
                  ....*....|....*
gi 15238810   411 AANYLVEMTEMGLVP 425
Cdd:PLN03218  668 AFEILQDARKQGIKL 682
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
211-244 1.22e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 56.20  E-value: 1.22e-10
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15238810   211 KGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMS 244
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
184-216 9.10e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.44  E-value: 9.10e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15238810   184 VYNSLLHALCDVKMFHGAYALIRRMIRKGLKPD 216
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
 
Name Accession Description Interval E-value
PLN03218 PLN03218
maturation of RBCL 1; Provisional
181-425 7.02e-16

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 80.31  E-value: 7.02e-16
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   181 TVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIE 260
Cdd:PLN03218  436 TLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALID 515
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   261 GLLNAGYLESA----KEMVSKMTKggfvPDIQTFNILIEAISKSGEVE--FCI--EMYYTACKlglcVDID--TYKTLIP 330
Cdd:PLN03218  516 GCARAGQVAKAfgayGIMRSKNVK----PDRVVFNALISACGQSGAVDraFDVlaEMKAETHP----IDPDhiTVGALMK 587
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   331 AVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMCGRGGKFVD 410
Cdd:PLN03218  588 ACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDK 667
                         250
                  ....*....|....*
gi 15238810   411 AANYLVEMTEMGLVP 425
Cdd:PLN03218  668 AFEILQDARKQGIKL 682
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
136-407 2.80e-15

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 78.37  E-value: 2.80e-15
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  136 QMKDLSLDISGETLCFIIEQYGKNGHVDQAVELFNGVpkTLGCQQTVDV--YNSLLHALCDVKMFHGAYALIRRMIRKGL 213
Cdd:PLN03081  77 RLDDTQIRKSGVSLCSQIEKLVACGRHREALELFEIL--EAGCPFTLPAstYDALVEACIALKSIRCVKAVYWHVESSGF 154
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  214 KPDKRTYAILVNGWCSAGKMKEAQEFLDEMsrrgfnpPARGR---DLLIEGLLNAGYLESAKEMVSKMTKGGFVPDIQTF 290
Cdd:PLN03081 155 EPDQYMMNRVLLMHVKCGMLIDARRLFDEM-------PERNLaswGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTF 227
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  291 NILIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAfrllnNCVEDGhKPFPSLYA--PIIKGMC 368
Cdd:PLN03081 228 VVMLRASAGLGSARAGQQLHCCVLKTGVVGDTFVSCALIDMYSKCGDIEDA-----RCVFDG-MPEKTTVAwnSMLAGYA 301
                        250       260       270
                 ....*....|....*....|....*....|....*....
gi 15238810  369 RNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMCGRGGK 407
Cdd:PLN03081 302 LHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLAL 340
PLN03218 PLN03218
maturation of RBCL 1; Provisional
210-419 1.99e-14

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 76.07  E-value: 1.99e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   210 RKGLKP--DKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFnppargrdLLIEGLLNAGYLESAK-----EMVSKMTKGG 282
Cdd:PLN03218  361 NGGVSGkrKSPEYIDAYNRLLRDGRIKDCIDLLEDMEKRGL--------LDMDKIYHAKFFKACKkqravKEAFRFAKLI 432
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   283 FVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAP 362
Cdd:PLN03218  433 RNPTLSTFNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGA 512
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*..
gi 15238810   363 IIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMCGRGGKFVDAANYLVEMT 419
Cdd:PLN03218  513 LIDGCARAGQVAKAFGAYGIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMK 569
PLN03218 PLN03218
maturation of RBCL 1; Provisional
152-425 8.85e-14

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 73.76  E-value: 8.85e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   152 IIEQYGKNGHVDQAVELFNGVPKTlGCQQTVDVYNSLLHAlCD-----VKMFhGAYALirrMIRKGLKPDKRTYAILVNG 226
Cdd:PLN03218  478 LISTCAKSGKVDAMFEVFHEMVNA-GVEANVHTFGALIDG-CAragqvAKAF-GAYGI---MRSKNVKPDRVVFNALISA 551
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   227 WCSAGKMKEAQEFLDEMSRRG--FNPPARGRDLLIEGLLNAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVE 304
Cdd:PLN03218  552 CGQSGAVDRAFDVLAEMKAEThpIDPDHITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWD 631
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   305 FCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIkGMCRN-GMFDDAFSFFSDM 383
Cdd:PLN03218  632 FALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLM-GACSNaKNWKKALELYEDI 710
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|..
gi 15238810   384 KVKAHPPNRPVYTMLITMCGRGGKFVDAANYLVEMTEMGLVP 425
Cdd:PLN03218  711 KSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCP 752
PLN03218 PLN03218
maturation of RBCL 1; Provisional
86-429 2.49e-12

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 69.14  E-value: 2.49e-12
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810    86 ATSRSSNDSLRFFNWARSNpSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGKNGHVDQA 165
Cdd:PLN03218  448 ASSQDIDGALRVLRLVQEA-GLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKA 526
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   166 VELFnGVPKTLGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMI--RKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEM 243
Cdd:PLN03218  527 FGAY-GIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEMKaeTHPIDPDHITVGALMKACANAGQVDRAKEVYQMI 605
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   244 SRRGFnppaRGRDLLIEGLLNA----GYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLC 319
Cdd:PLN03218  606 HEYNI----KGTPEVYTIAVNScsqkGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAFEILQDARKQGIK 681
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   320 VDIDTYKTLIPAVSKIGKIDEAFRLLNNCVEDGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLI 399
Cdd:PLN03218  682 LGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNTITYSILL 761
                         330       340       350
                  ....*....|....*....|....*....|...
gi 15238810   400 TMCGRGGKFVDAANYLVEMTEMGLVP---ISRC 429
Cdd:PLN03218  762 VASERKDDADVGLDLLSQAKEDGIKPnlvMCRC 794
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
150-402 2.04e-11

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 66.05  E-value: 2.04e-11
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  150 CFIIEQYGKNGHVDQAVELFNGVPktlgcQQTVDVYNSLLHALCdvkmFHG----AYALIRRMIRKGLKPDKRTYAILVN 225
Cdd:PLN03081 263 CALIDMYSKCGDIEDARCVFDGMP-----EKTTVAWNSMLAGYA----LHGyseeALCLYYEMRDSGVSIDQFTFSIMIR 333
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  226 GWCSAGKMKEAQEFLDEMSRRGFNPPARGRDLLIEGLLNAGYLESAKEMVSKMTKggfvPDIQTFNILIEAISKSGEVEF 305
Cdd:PLN03081 334 IFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPR----KNLISWNALIAGYGNHGRGTK 409
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  306 CIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAFRLLNNCVED-GHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMK 384
Cdd:PLN03081 410 AVEMFERMIAEGVAPNHVTFLAVLSACRYSGLSEQGWEIFQSMSENhRIKPRAMHYACMIELLGREGLLDEAYAMIRRAP 489
                        250
                 ....*....|....*...
gi 15238810  385 VKahpPNRPVYTMLITMC 402
Cdd:PLN03081 490 FK---PTVNMWAALLTAC 504
PLN03218 PLN03218
maturation of RBCL 1; Provisional
158-404 3.80e-11

maturation of RBCL 1; Provisional


Pssm-ID: 215636 [Multi-domain]  Cd Length: 1060  Bit Score: 65.67  E-value: 3.80e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   158 KNGHVDQAVELFNGVPKTlGCQQTVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQ 237
Cdd:PLN03218  591 NAGQVDRAKEVYQMIHEY-NIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMKKKGVKPDEVFFSALVDVAGHAGDLDKAF 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   238 EFLDEMSRRGFNPPARGRDLLIEGLLNAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCIEMYYTACKLG 317
Cdd:PLN03218  670 EILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPTVSTMNALITALCEGNQLPKALEVLSEMKRLG 749
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   318 LCVDIDTYKTLIPAVSkigkideafrllnncvedghkpfpslyapiikgmcRNGMFDDAFSFFSDMKVKAHPPNRPVYTM 397
Cdd:PLN03218  750 LCPNTITYSILLVASE-----------------------------------RKDDADVGLDLLSQAKEDGIKPNLVMCRC 794

                  ....*..
gi 15238810   398 LITMCGR 404
Cdd:PLN03218  795 ITGLCLR 801
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
211-244 1.22e-10

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 56.20  E-value: 1.22e-10
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15238810   211 KGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMS 244
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
93-406 1.04e-09

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 60.65  E-value: 1.04e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   93 DSLRFFNWARSNPSYTPTSMEYEELAKSLASHKKYESMWKILKQMKDLSLDISGETLCFIIEQYGKNGHVDQAVELFNGV 172
Cdd:PLN03081 105 EALELFEILEAGCPFTLPASTYDALVEACIALKSIRCVKAVYWHVESSGFEPDQYMMNRVLLMHVKCGMLIDARRLFDEM 184
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  173 PktlgcQQTVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAI------------------------------ 222
Cdd:PLN03081 185 P-----ERNLASWGTIIGGLVDAGNYREAFALFREMWEDGSDAEPRTFVVmlrasaglgsaragqqlhccvlktgvvgdt 259
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  223 -----LVNGWCSAGKMKEAQEFLDEMSRR---GFNPpargrdlLIEGLLNAGYLESAKEMVSKMTKGGFVPDIQTFNILI 294
Cdd:PLN03081 260 fvscaLIDMYSKCGDIEDARCVFDGMPEKttvAWNS-------MLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMI 332
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  295 EAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAFRLLNNCvedGHKPFPSLYApIIKGMCRNGMFD 374
Cdd:PLN03081 333 RIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRM---PRKNLISWNA-LIAGYGNHGRGT 408
                        330       340       350
                 ....*....|....*....|....*....|..
gi 15238810  375 DAFSFFSDMKVKAHPPNRPVYTMLITMCGRGG 406
Cdd:PLN03081 409 KAVEMFERMIAEGVAPNHVTFLAVLSACRYSG 440
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
181-228 2.93e-09

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 52.75  E-value: 2.93e-09
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*...
gi 15238810   181 TVDVYNSLLHALCDVKMFHGAYALIRRMIRKGLKPDKRTYAILVNGWC 228
Cdd:pfam13041   2 DVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLC 49
PLN03077 PLN03077
Protein ECB2; Provisional
152-418 3.40e-09

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 59.09  E-value: 3.40e-09
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  152 IIEQYGKNGHVDQAVELFNGVPktlgcQQTVDVYNSLLHALC-DVKMFHGAYaLIRRMIRKgLKPDKRTY-----AILVN 225
Cdd:PLN03077 430 LIEMYSKCKCIDKALEVFHNIP-----EKDVISWTSIIAGLRlNNRCFEALI-FFRQMLLT-LKPNSVTLiaalsACARI 502
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  226 GWCSAGKMKEAQEFLDEMSRRGFNPPArgrdlLIEGLLNAGYLESAKEMVSKMTKggfvpDIQTFNILIEAISKSGEVEF 305
Cdd:PLN03077 503 GALMCGKEIHAHVLRTGIGFDGFLPNA-----LLDLYVRCGRMNYAWNQFNSHEK-----DVVSWNILLTGYVAHGKGSM 572
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  306 CIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAFRLLNNcVEDGHKPFPSL--YAPIIKGMCRNGMFDDAFSFFSDM 383
Cdd:PLN03077 573 AVELFNRMVESGVNPDEVTFISLLCACSRSGMVTQGLEYFHS-MEEKYSITPNLkhYACVVDLLGRAGKLTEAYNFINKM 651
                        250       260       270
                 ....*....|....*....|....*....|....*....
gi 15238810  384 KVKahpPNRPVYTMLITMCgRGGKFVD----AANYLVEM 418
Cdd:PLN03077 652 PIT---PDPAVWGALLNAC-RIHRHVElgelAAQHIFEL 686
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
363-404 1.80e-08

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 50.44  E-value: 1.80e-08
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 15238810   363 IIKGMCRNGMFDDAFSFFSDMKVKAHPPNRPVYTMLITMCGR 404
Cdd:pfam13041   9 LINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
258-299 4.02e-06

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 43.51  E-value: 4.02e-06
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|..
gi 15238810   258 LIEGLLNAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISK 299
Cdd:pfam13041   9 LINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
184-216 9.10e-06

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 42.44  E-value: 9.10e-06
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15238810   184 VYNSLLHALCDVKMFHGAYALIRRMIRKGLKPD 216
Cdd:TIGR00756   2 TYNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
215-250 1.05e-04

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 39.65  E-value: 1.05e-04
                          10        20        30
                  ....*....|....*....|....*....|....*.
gi 15238810   215 PDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNP 250
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKP 36
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
240-299 1.55e-04

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 39.65  E-value: 1.55e-04
                          10        20        30        40        50        60
                  ....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810   240 LDEMSRRGFNPPARGRDLLIEGLLNAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISK 299
Cdd:pfam13812   3 LREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKPTLDTYNAILGVIGG 62
PPR_1 pfam12854
PPR repeat; This family matches additional variants of the PPR repeat that were not captured ...
351-384 1.73e-04

PPR repeat; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. The exact function is not known.


Pssm-ID: 403914 [Multi-domain]  Cd Length: 34  Bit Score: 38.48  E-value: 1.73e-04
                          10        20        30
                  ....*....|....*....|....*....|....
gi 15238810   351 DGHKPFPSLYAPIIKGMCRNGMFDDAFSFFSDMK 384
Cdd:pfam12854   1 KGLKPDVVTYNTLINGLCRAGRVDEAFELLDEME 34
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
360-391 2.19e-04

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 38.59  E-value: 2.19e-04
                          10        20        30
                  ....*....|....*....|....*....|..
gi 15238810   360 YAPIIKGMCRNGMFDDAFSFFSDMKVKAHPPN 391
Cdd:TIGR00756   3 YNTLIDGLCKAGRVEEALELFKEMKERGIEPD 34
PLN03077 PLN03077
Protein ECB2; Provisional
156-439 4.39e-04

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 42.91  E-value: 4.39e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  156 YGKNGHVDQAVELFNgvpKTLGCQQTVDVYNsllhALCDVKMFHGAYALIR------RMIRKGLKPDKRTYAILVNGWCS 229
Cdd:PLN03077 162 YAKAGYFDEALCLYH---RMLWAGVRPDVYT----FPCVLRTCGGIPDLARgrevhaHVVRFGFELDVDVVNALITMYVK 234
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  230 AGKMKEAQEFLDEMSRRgfnppargrDLLIEGLLNAGYLESAK-----EMVSKMTKGGFVPDIQTFNILIEAISKSGEVE 304
Cdd:PLN03077 235 CGDVVSARLVFDRMPRR---------DCISWNAMISGYFENGEcleglELFFTMRELSVDPDLMTITSVISACELLGDER 305
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  305 FCIEMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAFRLLNNCvedGHKPFPSlYAPIIKGMCRNGMFDDAFSFFSDMK 384
Cdd:PLN03077 306 LGREMHGYVVKTGFAVDVSVCNSLIQMYLSLGSWGEAEKVFSRM---ETKDAVS-WTAMISGYEKNGLPDKALETYALME 381
                        250       260       270       280       290       300       310
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 15238810  385 VKAHPPNRPVYTMLITMCGRGGK------------------FVDAANYLVEMTEMglvpiSRCFDMVTDGLKN 439
Cdd:PLN03077 382 QDNVSPDEITIASVLSACACLGDldvgvklhelaerkglisYVVVANALIEMYSK-----CKCIDKALEVFHN 449
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
219-248 6.39e-04

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 37.06  E-value: 6.39e-04
                          10        20        30
                  ....*....|....*....|....*....|
gi 15238810   219 TYAILVNGWCSAGKMKEAQEFLDEMSRRGF 248
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PRK08173 PRK08173
DNA topoisomerase III; Validated
214-275 1.06e-03

DNA topoisomerase III; Validated


Pssm-ID: 236172 [Multi-domain]  Cd Length: 862  Bit Score: 41.58  E-value: 1.06e-03
                         10        20        30        40        50        60
                 ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 15238810  214 KPDKR-TYAILVNGWCSAGKMKEAQEFLDEMSRRGFNPPARgRDLLIEGLLNAGYLE-SAKEMV 275
Cdd:PRK08173 485 KPPARyNEATLLSAMEGAGKLVEDDELREAMAEKGLGTPAT-RAAIIEGLLGEKYLVrEGRELI 547
PLN03081 PLN03081
pentatricopeptide (PPR) repeat-containing protein; Provisional
152-247 1.45e-03

pentatricopeptide (PPR) repeat-containing protein; Provisional


Pssm-ID: 215563 [Multi-domain]  Cd Length: 697  Bit Score: 41.01  E-value: 1.45e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  152 IIEQYGKNGHVDQAVELFNGVPKTlgcqQTVDVYNSLLHAlCDVkmfHGAYALIRRMIRK--GLKPDK-RTYAILVNGWC 228
Cdd:PLN03081 468 MIELLGREGLLDEAYAMIRRAPFK----PTVNMWAALLTA-CRI---HKNLELGRLAAEKlyGMGPEKlNNYVVLLNLYN 539
                         90
                 ....*....|....*....
gi 15238810  229 SAGKMKEAQEFLDEMSRRG 247
Cdd:PLN03081 540 SSGRQAEAAKVVETLKRKG 558
PLN03077 PLN03077
Protein ECB2; Provisional
159-403 2.66e-03

Protein ECB2; Provisional


Pssm-ID: 215561 [Multi-domain]  Cd Length: 857  Bit Score: 40.22  E-value: 2.66e-03
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  159 NGHVDQAVELFNGVPKtLGCQQTVDVYNSLLHaLCDVKMFH--GAYA---LIRRMIRKGLKPDKRTYAILV------NGW 227
Cdd:PLN03077  64 HGQLEQALKLLESMQE-LRVPVDEDAYVALFR-LCEWKRAVeeGSRVcsrALSSHPSLGVRLGNAMLSMFVrfgelvHAW 141
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  228 CSAGKMKEAQEFldemsrrgfnpparGRDLLIEGLLNAGYLESAKEMVSKMTKGGFVPDIQTFNILIEAISKSGEVEFCI 307
Cdd:PLN03077 142 YVFGKMPERDLF--------------SWNVLVGGYAKAGYFDEALCLYHRMLWAGVRPDVYTFPCVLRTCGGIPDLARGR 207
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 15238810  308 EMYYTACKLGLCVDIDTYKTLIPAVSKIGKIDEAFRLLN-----NCVEdghkpfpslYAPIIKGMCRNGMFDDAFSFFSD 382
Cdd:PLN03077 208 EVHAHVVRFGFELDVDVVNALITMYVKCGDVVSARLVFDrmprrDCIS---------WNAMISGYFENGECLEGLELFFT 278
                        250       260
                 ....*....|....*....|.
gi 15238810  383 MKVKAHPPNRPVYTMLITMCG 403
Cdd:PLN03077 279 MRELSVDPDLMTITSVISACE 299
PPR TIGR00756
pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR ...
218-250 2.75e-03

pentatricopeptide repeat domain (PPR motif); This model describes a domain called the PPR motif, or pentatricopeptide repeat. Its consensus sequence is 35 positions long and typically is found in four or more tandem copies. This family is strongly represented in plant proteins, particularly those sorted to chloroplasts or mitochondria. The pfam01535, domain of unknown function DUF17, consists of 6 copies of this repeat. This family has a similar consensus to the TPR domain (tetratricopeptide), pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.


Pssm-ID: 273253 [Multi-domain]  Cd Length: 35  Bit Score: 35.12  E-value: 2.75e-03
                          10        20        30
                  ....*....|....*....|....*....|...
gi 15238810   218 RTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNP 250
Cdd:TIGR00756   1 VTYNTLIDGLCKAGRVEEALELFKEMKERGIEP 33
PPR_long pfam17177
Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large ...
374-431 5.12e-03

Pentacotripeptide-repeat region of PRORP; Pentatricopeptide repeat (PPR) proteins are a large family of modular RNA-binding proteins which mediate several aspects of gene expression primarily in organelles but also in the nucleus. PPR_long is the region of Arabidopsis protein-only RNase P (PRORP) enzyme that consists of up to eleven alpha-helices. PRORPs are a class of RNA processing enzymes that catalyze maturation of the 5' end of precursor tRNAs in Eukaryotes. All PPR proteins contain tandemly repeated sequence motifs (the PPR motifs) which can vary in number. The series of helix-turn-helix motifs formed by PPR motifs throughout the protein produces a superheros with a central groove that allows the protein to bind RNA. Proteins containing PPR motifs are known to have roles in transcription, RNA processing, splicing, stability, editing, and translation. Over a decade after the discovery of PPR proteins, the super-helical structure was confirmed. The protein-only mitochondrial RNase P crystal structure from Arabidopsis thaliana (PRORP1) confirmed the role of its PPR motifs in pre-tRNA binding and suggest it has evolved independently from other RNase P proteins that rely on catalytic RNA.


Pssm-ID: 407303 [Multi-domain]  Cd Length: 212  Bit Score: 38.15  E-value: 5.12e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 15238810   374 DDAFSFFSDMKVKAHPPNRPVYTMLITMCGRGGKFvDAANYLV-EMTEMGLVPISRCFD 431
Cdd:pfam17177  72 DRGFEVFEAMKAQGVSPNEATYTAVARLAAAKGDG-DLAFDLVkEMEAAGVSPRLRSYS 129
PPR_2 pfam13041
PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is ...
285-334 8.24e-03

PPR repeat family; This repeat has no known function. It is about 35 amino acids long and is found in up to 18 copies in some proteins. The family appears to be greatly expanded in plants and fungi. The repeat has been called PPR.


Pssm-ID: 463778 [Multi-domain]  Cd Length: 50  Bit Score: 34.26  E-value: 8.24e-03
                          10        20        30        40        50
                  ....*....|....*....|....*....|....*....|....*....|
gi 15238810   285 PDIQTFNILIEAISKSGEVEFCIEMYYTACKLGLCVDIDTYKTLIPAVSK 334
Cdd:pfam13041   1 PDVVTYNTLINGYCKKGKVEEAFKLFNEMKKRGVKPNVYTYTILINGLCK 50
PPR pfam01535
PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up ...
184-213 8.32e-03

PPR repeat; This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. This repeat occurs in PET309 that may be involved in RNA stabilization. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein Swiss:O49549 that has a domain organization similar to the human BRCA1 protein. The repeat has been called PPR.


Pssm-ID: 366695 [Multi-domain]  Cd Length: 31  Bit Score: 33.98  E-value: 8.32e-03
                          10        20        30
                  ....*....|....*....|....*....|
gi 15238810   184 VYNSLLHALCDVKMFHGAYALIRRMIRKGL 213
Cdd:pfam01535   2 TYNSLISGYCKNGKLEEALELFKEMKEKGI 31
PPR_3 pfam13812
Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat ...
204-250 9.66e-03

Pentatricopeptide repeat domain; This family matches additional variants of the PPR repeat that were not captured by the model for pfam01535. In the case of the Arabidopsis protein UniProtKB:Q66GI4, the repeated helices in this N-terminal region, of protein-only RNase P (PRORP) enzymes, form the pentatricopeptide repeat (PPR) domain which enhances pre-tRNA binding affinity. PROPRP enzymes process precursor tRNAs in human mitochondria and in all tRNA-using compartments of Arabidopsis thaliana.


Pssm-ID: 316342 [Multi-domain]  Cd Length: 63  Bit Score: 34.64  E-value: 9.66e-03
                          10        20        30        40
                  ....*....|....*....|....*....|....*....|....*..
gi 15238810   204 LIRRMIRKGLKPDKRTYAILVNGWCSAGKMKEAQEFLDEMSRRGFNP 250
Cdd:pfam13812   2 ILREMVRDGIQLNVNTYTHLLHAYANVGNLKLALEIFERMKKKGIKP 48
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH