NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|115583687|ref|NP_619600|]
View 

pre-mRNA-processing-splicing factor 8 [Mus musculus]

Protein Classification

PRP8 family protein( domain architecture ID 11474092)

PRP8 family protein similar to Homo sapiens pre-mRNA-processing-splicing factor 8 that plays role in pre-mRNA splicing as core component of precatalytic, catalytic and postcatalytic spliceosomal complexes, both of the predominant U2-type spliceosome and the minor U12-type spliceosome

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
1-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


:

Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3576.09  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687    1 MAGVFP-YRGPGNPVPG-------PLAPLPDYMSEEK--------------LQEKARKWQQLQ-AKRYAEKRKFGFVDAQ 57
Cdd:COG5178     1 MASLPPgNPPPPPPPPGfeppsqpPPPPPPGVNVKKRsrkqlsivgdilghSGNPIYSLRVSDkPVKLGNKAKTLHVLTL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:COG5178    81 KAPIPPEHLRKIQSPCSDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSR 217
Cdd:COG5178   161 PQLFAQWGLCWSPMRREKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  218 KyVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFN 297
Cdd:COG5178   241 S-VNGAMYRRWKYMLPAMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFN 319
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  298 DINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLP 372
Cdd:COG5178   320 GIVRIIRRPPIDDEYPVAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNE 399
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  373 D----DDEEFELPEFvEPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQ 448
Cdd:COG5178   400 EscvmERKLFSEPIF-YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQ 478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  449 KLLKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVK 528
Cdd:COG5178   479 KLLKNYVRNMLHKTRPRPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTK 558
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  529 TLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDL 608
Cdd:COG5178   559 TLTTKERKKSRVGNSFHLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDW 638
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  609 KHLIYYRFNTGpVGKGPGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAA 688
Cdd:COG5178   639 KHLIYYAFNEG-VGKGPGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQ 716
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  689 VMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVD 768
Cdd:COG5178   717 VMADILSMLPEGIRQTKVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVG 796
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  769 KTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKE 848
Cdd:COG5178   797 KTKEKKNLGRLTRLWVKLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKS 876
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  849 AYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWY 928
Cdd:COG5178   877 KYTGKIRLNESTREELALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWY 956
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  929 EADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADY 1008
Cdd:COG5178   957 EADRRNLFPEWIKPSDSEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADY 1036
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1009 MTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLF 1088
Cdd:COG5178  1037 IIAKNNVVVVYKDMSHTNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLY 1116
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1089 CRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVT 1166
Cdd:COG5178  1117 TRYMDDIYIVFRFQRKEEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLT 1196
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1167 TVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQ 1246
Cdd:COG5178  1197 ATRWEPSFGSVYSKINPNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRH 1275
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1247 ILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLG 1326
Cdd:COG5178  1276 ILMTSGSTTFTKVATKWNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLG 1355
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1327 MLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLE 1406
Cdd:COG5178  1356 MLSVGHILIPHSDLEWSKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLE 1435
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1407 DLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILE 1486
Cdd:COG5178  1436 DLEMSWDRGIPRISTLFQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILE 1515
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1487 HTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKI 1566
Cdd:COG5178  1516 HTLFKATGFRSWEGLFWEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKI 1595
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1567 PTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLA 1646
Cdd:COG5178  1596 PTLKISLIQIFRNHLWQKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLL 1675
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1647 DSKDVMDSTTTQKYWIDIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLI 1726
Cdd:COG5178  1676 EERDGGSNVRTNKLWIDVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLI 1755
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1727 QQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGA 1806
Cdd:COG5178  1756 QSSMERIMKANPALYVLRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGA 1835
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1807 IFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKG 1886
Cdd:COG5178  1836 IFVLNPATGNLFLKVIHTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRT 1915
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1887 SELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTVTEPHH 1966
Cdd:COG5178  1916 CELALPFSAVMGIDKIRDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINH 1995
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1967 IWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQT 2044
Cdd:COG5178  1996 LWPGFSDSQWIKKEIQLRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVST 2075
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2045 RTVNKHGDEIITSTTSNYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQ 2123
Cdd:COG5178  2076 KTINGWGREYVVLDGMIYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQ 2155
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2124 IAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTVHLPSQLPQHE-YLKEMEPLGWIHTQPNESPQLSPQDVTTHAKIMAD 2202
Cdd:COG5178  2156 VAGLVYGKSGSDNPQIKEILSFGLVPQLGSLSGVQSSSFVPHDLpGDEDLEILGWIHTQDDELPYLEVAGVLTHRKKIVD 2235
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2203 nPSWDgekTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNY 2281
Cdd:COG5178  2236 -PEWD---AVTLTVSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEVWNY 2311
                        2330      2340      2350      2360      2370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 115583687 2282 NFMGVRHDPNMKYELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178  2312 NFAGPFFDDRLEYTWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
 
Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
1-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3576.09  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687    1 MAGVFP-YRGPGNPVPG-------PLAPLPDYMSEEK--------------LQEKARKWQQLQ-AKRYAEKRKFGFVDAQ 57
Cdd:COG5178     1 MASLPPgNPPPPPPPPGfeppsqpPPPPPPGVNVKKRsrkqlsivgdilghSGNPIYSLRVSDkPVKLGNKAKTLHVLTL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:COG5178    81 KAPIPPEHLRKIQSPCSDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSR 217
Cdd:COG5178   161 PQLFAQWGLCWSPMRREKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  218 KyVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFN 297
Cdd:COG5178   241 S-VNGAMYRRWKYMLPAMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFN 319
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  298 DINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLP 372
Cdd:COG5178   320 GIVRIIRRPPIDDEYPVAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNE 399
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  373 D----DDEEFELPEFvEPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQ 448
Cdd:COG5178   400 EscvmERKLFSEPIF-YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQ 478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  449 KLLKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVK 528
Cdd:COG5178   479 KLLKNYVRNMLHKTRPRPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTK 558
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  529 TLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDL 608
Cdd:COG5178   559 TLTTKERKKSRVGNSFHLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDW 638
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  609 KHLIYYRFNTGpVGKGPGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAA 688
Cdd:COG5178   639 KHLIYYAFNEG-VGKGPGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQ 716
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  689 VMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVD 768
Cdd:COG5178   717 VMADILSMLPEGIRQTKVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVG 796
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  769 KTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKE 848
Cdd:COG5178   797 KTKEKKNLGRLTRLWVKLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKS 876
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  849 AYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWY 928
Cdd:COG5178   877 KYTGKIRLNESTREELALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWY 956
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  929 EADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADY 1008
Cdd:COG5178   957 EADRRNLFPEWIKPSDSEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADY 1036
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1009 MTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLF 1088
Cdd:COG5178  1037 IIAKNNVVVVYKDMSHTNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLY 1116
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1089 CRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVT 1166
Cdd:COG5178  1117 TRYMDDIYIVFRFQRKEEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLT 1196
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1167 TVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQ 1246
Cdd:COG5178  1197 ATRWEPSFGSVYSKINPNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRH 1275
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1247 ILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLG 1326
Cdd:COG5178  1276 ILMTSGSTTFTKVATKWNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLG 1355
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1327 MLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLE 1406
Cdd:COG5178  1356 MLSVGHILIPHSDLEWSKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLE 1435
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1407 DLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILE 1486
Cdd:COG5178  1436 DLEMSWDRGIPRISTLFQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILE 1515
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1487 HTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKI 1566
Cdd:COG5178  1516 HTLFKATGFRSWEGLFWEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKI 1595
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1567 PTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLA 1646
Cdd:COG5178  1596 PTLKISLIQIFRNHLWQKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLL 1675
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1647 DSKDVMDSTTTQKYWIDIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLI 1726
Cdd:COG5178  1676 EERDGGSNVRTNKLWIDVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLI 1755
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1727 QQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGA 1806
Cdd:COG5178  1756 QSSMERIMKANPALYVLRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGA 1835
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1807 IFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKG 1886
Cdd:COG5178  1836 IFVLNPATGNLFLKVIHTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRT 1915
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1887 SELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTVTEPHH 1966
Cdd:COG5178  1916 CELALPFSAVMGIDKIRDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINH 1995
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1967 IWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQT 2044
Cdd:COG5178  1996 LWPGFSDSQWIKKEIQLRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVST 2075
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2045 RTVNKHGDEIITSTTSNYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQ 2123
Cdd:COG5178  2076 KTINGWGREYVVLDGMIYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQ 2155
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2124 IAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTVHLPSQLPQHE-YLKEMEPLGWIHTQPNESPQLSPQDVTTHAKIMAD 2202
Cdd:COG5178  2156 VAGLVYGKSGSDNPQIKEILSFGLVPQLGSLSGVQSSSFVPHDLpGDEDLEILGWIHTQDDELPYLEVAGVLTHRKKIVD 2235
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2203 nPSWDgekTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNY 2281
Cdd:COG5178  2236 -PEWD---AVTLTVSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEVWNY 2311
                        2330      2340      2350      2360      2370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 115583687 2282 NFMGVRHDPNMKYELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178  2312 NFAGPFFDDRLEYTWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
PROCN pfam08083
PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of ...
395-796 0e+00

PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 400431  Cd Length: 402  Bit Score: 873.25  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   395 TDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPPKAQKKRYLFR 474
Cdd:pfam08083    1 NENTKDGISLYWAPYPFNRRSGRTKRAQDVPLIKHWYREHPPSNYPVKVRVSYQKLLKNYVLNELHHRKQKKHKKKNLLK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   475 SFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLT 554
Cdd:pfam08083   81 SLKNTKFFQQTTIDWVEAGLQVCRQGHNMLNLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRFGNAFHLIRELLRLT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   555 KLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVGKGPGCGFWAAGW 634
Cdd:pfam08083  161 KLLVDAHVQYRLGNIDAYQLADGLQYIFNHVGQLTGMYRYKYKVMRQIRACKDLKHVIYSRFNTGEVGKGPGCGFWAPSW 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   635 RVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQNKARTILQHLS 714
Cdd:pfam08083  241 RVWIFFLRGIIPLLERWLGNLLARQFEGRKSKDVAKTITKQRVESYYDLELRASVMHDILDMMPEGVKQNKARTILQHLS 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   715 EAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNY 794
Cdd:pfam08083  321 EAWRCWKANIPWKVPGLPEPIENIILRYVKAKADWWTSVAHYNRERIKRGATVDKTVAKKNLGRLTRLWLKAEQERQANY 400

                   ..
gi 115583687   795 LK 796
Cdd:pfam08083  401 LK 402
RNase_H_like_Prp8_IV cd13838
Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a ...
1766-2016 0e+00

Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a RNase H like fold within its core structure but with little sequence similarity. Prp8, a spliceosome protein, interacts directly with the splice sites and branch regions of precursor-mRNAs and spliceosomal RNAs associated with catalysis of the two steps of splicing. Catalysis of RNA cleavage by RNase H-like proteins involves a two-metal mechanism in which adjacently-bound divalent magnesium ions promote hydrolysis by activation of a water nucleophile and stabilization of the transition-state. However, the Prp8 domain IV contains only one of the canonical metal-binding sites and the coordinating side chains are spatially conserved with respect to Mg2+-coordinating residues within the RNase H fold.


Pssm-ID: 260013  Cd Length: 251  Bit Score: 575.06  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1766 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 1845
Cdd:cd13838     1 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1846 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLK 1925
Cdd:cd13838    81 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNLYDDWLK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1926 TISSYTAFSRLILILRALHVNNDRAKVILKPDKTTVTEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 2005
Cdd:cd13838   161 TISSYTAFSRLILILRALHVNNEKAKIILKPDKTVITEPHHIWPTLSDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 240
                         250
                  ....*....|.
gi 115583687 2006 EIRDIILGMEI 2016
Cdd:cd13838   241 EIRDIILGMEI 251
JAB_MPN smart00232
JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal ...
2102-2233 1.81e-28

JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.


Pssm-ID: 214573  Cd Length: 135  Bit Score: 112.47  E-value: 1.81e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   2102 YTYILPKNVLKKFIciSDLRAQIAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTV-HLPSQLPQHEYLK---EMEPLGW 2177
Cdd:smart00232    4 VHPLVPLNILKHAI--RDGPEEVCGVLLGKSNKDRPEVKEVFAVPNEPQDDSVQEYdEDYSHLMDEELKKvnkDLEIVGW 81
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 115583687   2178 IHTQPNESPQLSPQDVTTHAKIMADNPSWDGeKTIIITCSFTpGSCTLTAYKLTPS 2233
Cdd:smart00232   82 YHSHPDESPFPSEVDVATHESYQAPWPISVV-LIVDPIKSFQ-GRLSLRAFRLTPE 135
 
Name Accession Description Interval E-value
PRP8 COG5178
U5 snRNP spliceosome subunit [RNA processing and modification];
1-2335 0e+00

U5 snRNP spliceosome subunit [RNA processing and modification];


Pssm-ID: 227505 [Multi-domain]  Cd Length: 2365  Bit Score: 3576.09  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687    1 MAGVFP-YRGPGNPVPG-------PLAPLPDYMSEEK--------------LQEKARKWQQLQ-AKRYAEKRKFGFVDAQ 57
Cdd:COG5178     1 MASLPPgNPPPPPPPPGfeppsqpPPPPPPGVNVKKRsrkqlsivgdilghSGNPIYSLRVSDkPVKLGNKAKTLHVLTL 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:COG5178    81 KAPIPPEHLRKIQSPCSDMPSVLTKVDKRSYLGALKYLPHAVLKLLENMPSPWEDVSEVKVLYHCHGAITFVNEVPRVIE 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYDHQPLRDSR 217
Cdd:COG5178   161 PQLFAQWGLCWSPMRREKRDRYSFKRKRFPPFDDLEPPLSKSQWVLGVEPLMPINIRLDRMDDEHVRDWVYTSRDLEDHP 240
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  218 KyVNGSTYQRWQFTLPMMSTLYRLANQLLTDLVDDNYFYLFDLKAFFTSKALNMAIPGGPKFEPLVRDINLQDEDWNEFN 297
Cdd:COG5178   241 S-VNGAMYRRWKYMLPAMHNLLRLMPMLWESIRDVNYVYLFSGLSFFVAKALNVAIPGGPKFEPLYSRESAEFEDENEFN 319
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  298 DINKIIIRQPIRTEYKIAFPYLYNNLPHHVHLTWYHTPNVVFIKT-EDPDLPAFYFDPLINPISHR----HSVKSQEPLP 372
Cdd:COG5178   320 GIVRIIRRPPIDDEYPVAFPGLYNSRPRSVAVECYGSPECRDVFLdEDEDYPANFKDPLINPILGVqldnHPYDGKGSNE 399
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  373 D----DDEEFELPEFvEPFLKDTPLYTDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQ 448
Cdd:COG5178   400 EscvmERKLFSEPIF-YPYLYNESTEVRTTERAHLLLKNPFPFNKGKGRAERAQDVPLDKPWLLGHCLQERPVKVPVSYQ 478
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  449 KLLKYYVLNALKHRPPKAQKKRYLFRSFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVK 528
Cdd:COG5178   479 KLLKNYVRNMLHKTRPRPHTNTHLLKELKNTKYFQRTEIDWVEAGLQLCRQGHNMLSLLIHRKGLTYLHLDYNFNLKPTK 558
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  529 TLTTKERKKSRFGNAFHLCREVLRLTKLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDL 608
Cdd:COG5178   559 TLTTKERKKSRVGNSFHLMREMLKFIKLIVDIHVQFRLGNIDAYQLADGVHYILNHVGQLTGIYRYKYKLMKQIRACKDW 638
                         650       660       670       680       690       700       710       720
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  609 KHLIYYRFNTGpVGKGPGCGFWAAGWRVWLFFMRGITPLLERWLGNLLARQFEGRhSKGVAKTVTKQRVESHFDLELRAA 688
Cdd:COG5178   639 KHLIYYAFNEG-VGKGPGCGFWGPQWRVWLFFLRGHIPLLERYIGNLVTRQFEGR-SDYNPKPLTKQRSDSGYDLELRRQ 716
                         730       740       750       760       770       780       790       800
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  689 VMHDILDMMPEGIKQNKARTILQHLSEAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVD 768
Cdd:COG5178   717 VMADILSMLPEGIRQTKVRTILQHLSEAWRCWKANIPWHVPGEPAPILEVIRRYIKSKADLWTSSAHFNRERISRGAGVG 796
                         810       820       830       840       850       860       870       880
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  769 KTVCKKNLGRLTRLYLKAEQERQHNYLKDGPYITAEEAVAVYTTTVHWLESRRFSPIPFPPLSYKHDTKLLILALERLKE 848
Cdd:COG5178   797 KTKEKKNLGRLTRLWVKLEQERQVDSAKVGPKSTKEEAKRIGKITVLWLESRMFEPIPFPPLRYKEDTKILVLALEYLKS 876
                         890       900       910       920       930       940       950       960
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  849 AYSVKSRLNQSQREELGLIEQAYDNPHEALSRIKRHLLTQRAFKEVGIEFMDLYSHLVPVYDVEPLEKITDAYLDQYLWY 928
Cdd:COG5178   877 KYTGKIRLNESTREELALLEKAYDNPHDTLFRIKKSLLTQRSFKEVGITLMRHYDGAIPVYSVDPVEKIVDAYLDQYLWY 956
                         970       980       990      1000      1010      1020      1030      1040
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  929 EADKRRLFPPWIKPADTEPPPLLVYKWCQGINNLQDVWETSEGECNVMLESRFEKMYEKIDLTLLNRLLRLIVDHNIADY 1008
Cdd:COG5178   957 EADRRNLFPEWIKPSDSEMPPLLVYKWCQGINNLKAAWDTSNGERLVLYETKLEGIMEKVDNTLLNRLLKLVLDPNLADY 1036
                        1050      1060      1070      1080      1090      1100      1110      1120
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1009 MTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPPQMPNDFLSFQDIATEAAHPIRLF 1088
Cdd:COG5178  1037 IIAKNNVVVVYKDMSHTNHYGLIRGLQFSSFIYQFYGLVVDLLVLGLQRATEIAGPADAPNVFMDFKSRATETSHPIRLY 1116
                        1130      1140      1150      1160      1170      1180      1190      1200
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1089 CRYIDRIHIFFRFTADEARDLIQRYLTEHPDPNNENIVGYNN--KKCWPRDARMRLMKHDVNLGRAVFWDIKNRLPRSVT 1166
Cdd:COG5178  1117 TRYMDDIYIVFRFQRKEEDSLLEDYLRENPDPEEANNERYRNyfKGCWPDDCRMRLGPLDVNLGRAVFWEILRRCPHSLT 1196
                        1210      1220      1230      1240      1250      1260      1270      1280
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1167 TVQWENSFVSVYSKDNPNLLFNMCGFECRILPKCRTSyEEFTHKDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQ 1246
Cdd:COG5178  1197 ATRWEPSFGSVYSKINPNLLFSMVGFEVRILPKIRKI-EERSLSSGVWRLGDGRTKQRTAHANLAVSEGGIEMFESRIRH 1275
                        1290      1300      1310      1320      1330      1340      1350      1360
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1247 ILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLVKCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLG 1326
Cdd:COG5178  1276 ILMTSGSTTFTKVATKWNTQLIALVTYYREAICDTKGLLDKLVKAERLIQNRVKKGLNSKMPVRFPPAVFYAPKELGGLG 1355
                        1370      1380      1390      1400      1410      1420      1430      1440
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1327 MLSMGHVLIPQSDLRWSKQTDVGITHFRSGMSHEEDQLIPNLYRYIQPWESEFIDSQRVWAEYALKRQEAIAQNRRLTLE 1406
Cdd:COG5178  1356 MLSVGHILIPHSDLEWSKQTDTGITHFRSGMTTNGERLIPAAMRYISRWEYEFEDSQRVWAEYARKRQEAGQQNRRLTLE 1435
                        1450      1460      1470      1480      1490      1500      1510      1520
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1407 DLEDSWDRGIPRINTLFQKDRHTLAYDKGWRVRTDFKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILE 1486
Cdd:COG5178  1436 DLEMSWDRGIPRISTLFQRDRHTLAYDRGFRMRSEFKQYSLKPNNPFWWTDAKHDGKLWSLNRYRLDVIQALGGVEGILE 1515
                        1530      1540      1550      1560      1570      1580      1590      1600
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1487 HTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNAQRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKI 1566
Cdd:COG5178  1516 HTLFKATGFRSWEGLFWEKASGFEESMKFKKLTNAQRMGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGILMHGKI 1595
                        1610      1620      1630      1640      1650      1660      1670      1680
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1567 PTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQELDALEIETVQKETIHPRKSYKMNSSCADILLFASYKWNVSRPSLLA 1646
Cdd:COG5178  1596 PTLKISLIQIFRNHLWQKIHESVVGDLCQVLDKELDVLQIETVQKETVHPRKSYKMNSSCADILLSGAYDWCVSSPSLLL 1675
                        1690      1700      1710      1720      1730      1740      1750      1760
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1647 DSKDVMDSTTTQKYWIDIQLRWGDYDSHDIERYARAKFLDYTTDNMSIYPSPTGVLIAIDLAYNLHSAYGNWFPGSKPLI 1726
Cdd:COG5178  1676 EERDGGSNVRTNKLWIDVQLRWGDYDSHDIHRYARAKFLDYTTDPQSMYPSPTGVVIGIDLCYNMWSAYGNWNEGLKPLI 1755
                        1770      1780      1790      1800      1810      1820      1830      1840
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1727 QQAMAKIMKANPALYVLRERIRKGLQLYSSEPTEPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGA 1806
Cdd:COG5178  1756 QSSMERIMKANPALYVLRERIRKGLQLYTSEPQEQYLSSSNYAELFSNSIDLFVDDTNVYRVTLHKTFEGNLTTKPINGA 1835
                        1850      1860      1870      1880      1890      1900      1910      1920
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1807 IFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKG 1886
Cdd:COG5178  1836 IFVLNPATGNLFLKVIHTSVWAGQKRLIQLAKWKTAEEVFALGRSLPVEEQPKQIIVTRKSMLDPLEVHILDFPNISIRT 1915
                        1930      1940      1950      1960      1970      1980      1990      2000
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1887 SELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTVTEPHH 1966
Cdd:COG5178  1916 CELALPFSAVMGIDKIRDLILRATEPQMVLFNLYDDWLQETSSYTAFSRLLLVLRALDVNEERVKEILRPDKSIITKINH 1995
                        2010      2020      2030      2040      2050      2060      2070      2080
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1967 IWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQSEIRDIILGMEISAPSQQRQQIAEIEKQTKE--QSQLTATQT 2044
Cdd:COG5178  1996 LWPGFSDSQWIKKEIQLRDLILDRYCSKHNINPSGLTQSEVRDIILGFRISAPSGARQETAETEKQNSEkaLSRPTNVST 2075
                        2090      2100      2110      2120      2130      2140      2150      2160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2045 RTVNKHGDEIITSTTSNYETQTFSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETG-YTYILPKNVLKKFICISDLRAQ 2123
Cdd:COG5178  2076 KTINGWGREYVVLDGMIYEGEKFSSKEEWRSEAIRTGPLELRTKNIYVTADENEESIqQMYRLPLNLLEKFMRISDPHVQ 2155
                        2170      2180      2190      2200      2210      2220      2230      2240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2124 IAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTVHLPSQLPQHE-YLKEMEPLGWIHTQPNESPQLSPQDVTTHAKIMAD 2202
Cdd:COG5178  2156 VAGLVYGKSGSDNPQIKEILSFGLVPQLGSLSGVQSSSFVPHDLpGDEDLEILGWIHTQDDELPYLEVAGVLTHRKKIVD 2235
                        2250      2260      2270      2280      2290      2300      2310      2320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2203 nPSWDgekTIIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNY 2281
Cdd:COG5178  2236 -PEWD---AVTLTVSYLPGSISLRAYVVKKEGCNWGSKNMDINSDeAIGVEPVLGKDCQLLLSDRIQGVFYVPEEEVWNY 2311
                        2330      2340      2350      2360      2370
                  ....*....|....*....|....*....|....*....|....*....|....
gi 115583687 2282 NFMGVRHDPNMKYELQLANPKEFYHEVHRPSHFLNFALLQEGEVYSADREDLYA 2335
Cdd:COG5178  2312 NFAGPFFDDRLEYTWKIGMPLGFYDGFHRPGHFSRFYELRAGGRLEEWQEDAFA 2365
PROCN pfam08083
PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of ...
395-796 0e+00

PROCN (NUC071) domain; The PROCN domain is the central domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 400431  Cd Length: 402  Bit Score: 873.25  E-value: 0e+00
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   395 TDNTANGIALLWAPRPFNLRSGRTRRALDIPLVKNWYREHCPAGQPVKVRVSYQKLLKYYVLNALKHRPPKAQKKRYLFR 474
Cdd:pfam08083    1 NENTKDGISLYWAPYPFNRRSGRTKRAQDVPLIKHWYREHPPSNYPVKVRVSYQKLLKNYVLNELHHRKQKKHKKKNLLK 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   475 SFKATKFFQSTKLDWVEVGLQVCRQGYNMLNLLIHRKNLNYLHLDYNFNLKPVKTLTTKERKKSRFGNAFHLCREVLRLT 554
Cdd:pfam08083   81 SLKNTKFFQQTTIDWVEAGLQVCRQGHNMLNLLIHRKGLTYLHLDYNFNLKPTKTLTTKERKKSRFGNAFHLIRELLRLT 160
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   555 KLVVDSHVQYRLGNVDAFQLADGLQYIFAHVGQLTGMYRYKYKLMRQIRMCKDLKHLIYYRFNTGPVGKGPGCGFWAAGW 634
Cdd:pfam08083  161 KLLVDAHVQYRLGNIDAYQLADGLQYIFNHVGQLTGMYRYKYKVMRQIRACKDLKHVIYSRFNTGEVGKGPGCGFWAPSW 240
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   635 RVWLFFMRGITPLLERWLGNLLARQFEGRHSKGVAKTVTKQRVESHFDLELRAAVMHDILDMMPEGIKQNKARTILQHLS 714
Cdd:pfam08083  241 RVWIFFLRGIIPLLERWLGNLLARQFEGRKSKDVAKTITKQRVESYYDLELRASVMHDILDMMPEGVKQNKARTILQHLS 320
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   715 EAWRCWKANIPWKVPGLPTPIENMILRYVKAKADWWTNTAHYNRERIRRGATVDKTVCKKNLGRLTRLYLKAEQERQHNY 794
Cdd:pfam08083  321 EAWRCWKANIPWKVPGLPEPIENIILRYVKAKADWWTSVAHYNRERIKRGATVDKTVAKKNLGRLTRLWLKAEQERQANY 400

                   ..
gi 115583687   795 LK 796
Cdd:pfam08083  401 LK 402
RNase_H_like_Prp8_IV cd13838
Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a ...
1766-2016 0e+00

Ribonuclease-like Prp8 domain IV core; This family contains Prp8 domain IV, which adopts a RNase H like fold within its core structure but with little sequence similarity. Prp8, a spliceosome protein, interacts directly with the splice sites and branch regions of precursor-mRNAs and spliceosomal RNAs associated with catalysis of the two steps of splicing. Catalysis of RNA cleavage by RNase H-like proteins involves a two-metal mechanism in which adjacently-bound divalent magnesium ions promote hydrolysis by activation of a water nucleophile and stabilization of the transition-state. However, the Prp8 domain IV contains only one of the canonical metal-binding sites and the coordinating side chains are spatially conserved with respect to Mg2+-coordinating residues within the RNase H fold.


Pssm-ID: 260013  Cd Length: 251  Bit Score: 575.06  E-value: 0e+00
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1766 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 1845
Cdd:cd13838     1 QNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKWKTAEEV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1846 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNLYDDWLK 1925
Cdd:cd13838    81 AALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKIEKFGDLILKATEPQMVLFNLYDDWLK 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 1926 TISSYTAFSRLILILRALHVNNDRAKVILKPDKTTVTEPHHIWPTLTDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 2005
Cdd:cd13838   161 TISSYTAFSRLILILRALHVNNEKAKIILKPDKTVITEPHHIWPTLSDEEWIKVEVQLKDLILADYGKKNNVNVASLTQS 240
                         250
                  ....*....|.
gi 115583687 2006 EIRDIILGMEI 2016
Cdd:cd13838   241 EIRDIILGMEI 251
MPN_PRP8 cd08056
Mpr1p, Pad1p N-terminal (MPN) domains without isopeptidase activity found in splicing factor ...
2067-2317 1.33e-179

Mpr1p, Pad1p N-terminal (MPN) domains without isopeptidase activity found in splicing factor Prp8; Members of this family are found in pre-mRNA-processing factor 8 (Prp8) which is a critical splicing factor, interacting with several other spliceosomal proteins, snRNAs, and the pre-mRNA, thus organizing and stabilizing the spliceosome catalytic core. Prp8 is one of the largest and most highly conserved of nuclear proteins, occupying a central position in the catalytic core of the spliceosome. Its C-terminal domain exhibits a JAB1/MPN-like core similar to deubiquitinating enzymes, but does not show catalytic isopeptidase activity, possibly because the putative isopeptidase center is covered by insertions and terminal appendices that are grafted onto this core, thus impairing the metal binding site. It is proposed that this domain is a protein interaction domain instead of a Zn(2+)-dependent metalloenzyme as proposed for some MPN proteins. The DEAD-box protein Brr2 and the GTPase Snu114 bind to the Prp8 C-terminus, a region where mutations in human Prp8 (hPrp8) cause a severe form of the genetic disorder retinitis pigmentosa, RP13, which leads to progressive photoreceptor degeneration in the retina and eventual blindness. At the N-terminus of Prp8, there are several domains, including a highly variable nuclear localization signal (NLS) motif rich in prolines, a conserved RNA recognition motif (RRM), and U5 and U6 snRNA binding sites.


Pssm-ID: 163687 [Multi-domain]  Cd Length: 252  Bit Score: 547.61  E-value: 1.33e-179
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2067 FSSKTEWRVRAISAANLHLRTNHIYVSSDDIKETGYTYILPKNVLKKFICISDLRAQIAGYLYGVSPPDNPQVKEIRCIV 2146
Cdd:cd08056     1 FSSKTDWRVRAIAATNLHLRTKNIYVSSDDIKETGYTYILPKNLLKKFISISDLRTQIAGYLYGKSPPDNPQVKEIRCIV 80
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2147 MVPQWGTHQTVHLPSQLPQHEYLKEMEPLGWIHTQPNESPQLSPQDVTTHAKIMADNPSWDGEKTIIITCSFTPGSCTLT 2226
Cdd:cd08056    81 LVPQLGTHQTVTLPQQLPQHEYLEDLEPLGWIHTQPNELPQLSPQDVTTHAKILADNPSWDGEKTVILTCSFTPGSCSLT 160
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2227 AYKLTPSGYEWGRQNTDKGNN-PKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNYNFMGVRHDPNMKYELQLANPKEFY 2305
Cdd:cd08056   161 AYKLTPEGYEWGKQNKDLGNNtPKGYSPSFYEKVQLLLSDRFLGFFLVPEDGVWNYNFMGAKHSPNMKYDLKLDIPKEFY 240
                         250
                  ....*....|..
gi 115583687 2306 HEVHRPSHFLNF 2317
Cdd:cd08056   241 HELHRPTHFLQF 252
PRP8_domainIV pfam12134
PRP8 domain IV core; This domain is found in eukaryotes, and is about 20 amino acids in length. ...
1760-1989 1.13e-164

PRP8 domain IV core; This domain is found in eukaryotes, and is about 20 amino acids in length. It is found associated with pfam10597, pfam10596, pfam10598, pfam08083, pfam08082, pfam01398, pfam08084. There is a conserved LILR sequence motif. The domain is a selenomethionine domain in a subunit of the spliceosome. The function of PRP8 domain IV is believed to be interaction with the splicosomal core.


Pssm-ID: 432353  Cd Length: 230  Bit Score: 505.36  E-value: 1.13e-164
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  1760 EPYLSSQNYGELFSNQIIWFVDDTNVYRVTIHKTFEGNLTTKPINGAIFIFNPRTGQLFLKIIHTSVWAGQKRLGQLAKW 1839
Cdd:pfam12134    1 EPFLNSQNYAELFSNQTQWFVDDTNVYRVTVHKTFEGNLTTKPVNGAVFILNPKTGQLFLKIIHTSVWAGQKRLSQLAKW 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  1840 KTAEEVAALIRSLPVEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIKGSELQLPFQACLKVEKFGDLILKATEPQMVLFNL 1919
Cdd:pfam12134   81 KTAEEVAALVRSLPPEEQPKQIIVTRKGMLDPLEVHLLDFPNIVIRGSELHLPFPAALKIEKLGDLVLKATEPQMVLFNI 160
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  1920 YDDWLKTISSYTAFSRLILILRALHVNNDRAKVILKPDKTTVTEPHHIWPTLTDEEWIKVEVQLKDLILA 1989
Cdd:pfam12134  161 YDDWLKSISPYTAFSRLILILRALHINPEKTKMILRPDPTVVTKPHHLWPTLSDQQWIEVEIQLRDLILS 230
PRO8NT pfam08082
PRO8NT (NUC069), PrP8 N-terminal domain; The PRO8NT domain is found at the N-terminus of ...
58-209 9.04e-116

PRO8NT (NUC069), PrP8 N-terminal domain; The PRO8NT domain is found at the N-terminus of pre-mRNA splicing factors of PRO8 family. The NLS or nuclear localization signal for these spliceosome proteins begins at the start and runs for 60 residues. N-terminal to this domain is a highly variable proline-rich region.


Pssm-ID: 462361 [Multi-domain]  Cd Length: 152  Bit Score: 362.77  E-value: 9.04e-116
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687    58 KEDMPPEHVRKIIRDHGDMTNRKFRHDKRVYLGALKYMPHAVLKLLENMPMPWEQIRDVPVLYHITGAISFVNEIPWVIE 137
Cdd:pfam08082    1 KEDMPPEHLRKIIKDHGDMSSKKFRSDKRSYLGALKYMPHAVLKLLENMPMPWEEVREVKVLYHVTGAITFVNEIPRVIE 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|..
gi 115583687   138 PVYISQWGSMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQLELDPEEDAPVLDWFYD 209
Cdd:pfam08082   81 PVYIAQWSTMWIMMRREKRDRRHFKRMRFPPFDDEEPPLDYADNILDVEPLEAIQMELDEEEDAAVIDWFYD 152
U6-snRNA_bdg pfam10596
U6-snRNA interacting domain of PrP8; This domain incorporates the interacting site for the ...
1442-1600 2.97e-108

U6-snRNA interacting domain of PrP8; This domain incorporates the interacting site for the U6-snRNA as part of the U4/U6.U5 tri-snRNPs complex of the spliceosome, and is the prime candidate for the role of cofactor for the spliceosome's RNA core. The essential spliceosomal protein Prp8 interacts with U5 and U6 snRNAs and with specific pre-mRNA sequences that participate in catalysis. This close association with crucial RNA sequences, together with extensive genetic evidence, suggests that Prp8 could directly affect the function of the catalytic core, perhaps acting as a splicing cofactor.


Pssm-ID: 431383  Cd Length: 159  Bit Score: 341.77  E-value: 2.97e-108
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  1442 FKQYQVLKQNPFWWTHQRHDGKLWNLNNYRTDMIQALGGVEGILEHTLFKGTYFPTWEGLFWEKASGFEESMKWKKLTNA 1521
Cdd:pfam10596    1 FKKYQLLKYNPFWWTHARHDGKLWNLERYRTDIIQALGGVEGILEHTLFKATGFPSWEGLFWDKASGFEESLKYKKLTNA 80
                           90       100       110       120       130       140       150
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....
gi 115583687  1522 QRSGLNQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGIFMHGKIPTLKISLIQIFRAHLWQKIHESIVMDLCQVFDQE 1600
Cdd:pfam10596   81 QRQGLSQIPNRRFTLWWSPTINRANVYVGFQVQLDLTGVFMHGKLPTLKISLIQIFRGHLWQKIHESVVMDLCQVLDQE 159
U5_2-snRNA_bdg pfam10597
U5-snRNA binding site 2 of PrP8; The essential spliceosomal protein Prp8 interacts with U5 and ...
1210-1343 1.78e-89

U5-snRNA binding site 2 of PrP8; The essential spliceosomal protein Prp8 interacts with U5 and U6 snRNAs and with specific pre-mRNA sequences that participate in catalysis. This close association with crucial RNA sequences, together with extensive genetic evidence, suggests that Prp8 could directly affect the function of the catalytic core, perhaps acting as a splicing cofactor.


Pssm-ID: 402297  Cd Length: 134  Bit Score: 286.97  E-value: 1.78e-89
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  1210 KDGVWNLQNEVTKERTAQCFLRVDDESMQRFHNRVRQILMASGSTTFTKIVNKWNTALIGLMTYFREAVVNTQELLDLLV 1289
Cdd:pfam10597    1 NEGVWDLINESTKERTAKAFLQVSEESINNFRNRIRQILMSSGSTTFTKIANKWNTALIALFTYFREAIVSTEELLDILV 80
                           90       100       110       120       130
                   ....*....|....*....|....*....|....*....|....*....|....
gi 115583687  1290 KCENKIQTRIKIGLNSKMPSRFPPVVFYTPKELGGLGMLSMGHVLIPQSDLRWS 1343
Cdd:pfam10597   81 KCETRVQNRVKLGLNSKMPSRFPPAVFYTPKELGGLGMLSAGHVLIPASDLRWS 134
PROCT pfam08084
PROCT (NUC072) domain; The PROCT domain is the C-terminal domain in pre-mRNA splicing factors ...
2212-2321 1.26e-67

PROCT (NUC072) domain; The PROCT domain is the C-terminal domain in pre-mRNA splicing factors of PRO8 family.


Pssm-ID: 462362  Cd Length: 111  Bit Score: 223.19  E-value: 1.26e-67
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  2212 IIITCSFTPGSCTLTAYKLTPSGYEWGRQNTDK-GNNPKGYLPSHYERVQMLLSDRFLGFFMVPAQSSWNYNFMGVRHDP 2290
Cdd:pfam08084    1 ITLTVSFTPGSVSLSAYTLTPEGYEWGRQNKDLiSDNPQGFSPSFSEKVQLLLSDRILGFFLVPEDGVWNYSFMGASFNP 80
                           90       100       110
                   ....*....|....*....|....*....|.
gi 115583687  2291 NMKYELQLANPKEFYHEVHRPSHFLNFALLQ 2321
Cdd:pfam08084   81 NMKYDLKLDIPLEFYDELHRPTHFLNFNELE 111
RRM_4 pfam10598
RNA recognition motif of the spliceosomal PrP8; The large RNA-protein complex of the ...
986-1077 5.09e-53

RNA recognition motif of the spliceosomal PrP8; The large RNA-protein complex of the spliceosome catalyzes pre-mRNA splicing. One of the most conserved core proteins is PrP8 which occupies a central position in the catalytic core of the spliceosome, and has been implicated in several crucial molecular rearrangements that occur there, and has recently come under the spotlight for its role in the inherited human disease, Retinitis Pigmentosa. The RNA-recognition motif of PrP8 is highly conserved and provides a possible RNA binding centre for the 5-prime SS, BP, or 3-prime SS of pre-mRNA which are known to contact with Prp8. The most conserved regions of an RRM are defined as the RNP1 and RNP2 sequences. Recognition of RNA targets can also be modulated by a number of other factors, most notably the two loops beta1-alpha1, beta2-beta3 and the amino acid residues C-terminal to the RNP2 domain.


Pssm-ID: 463163  Cd Length: 92  Bit Score: 180.90  E-value: 5.09e-53
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   986 EKIDLTLLNRLLRLIVDHNIADYMTAKNNVVINYKDMNHTNSYGIIRGLQFASFIVQYYGLVMDLLVLGLHRASEMAGPP 1065
Cdd:pfam10598    1 EKIDLTLLNRLLRLIMDHNLADYITAKNNVVLTFKDMSHVNSYGLIRGLQFSSFVYQYYGLVLDLLILGLQRASEIAGPP 80
                           90
                   ....*....|..
gi 115583687  1066 QMPNDFLSFQDI 1077
Cdd:pfam10598   81 QMPNEFLQFKDI 92
JAB_MPN smart00232
JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal ...
2102-2233 1.81e-28

JAB/MPN domain; Domain in Jun kinase activation domain binding protein and proteasomal subunits. Domain at Mpr1p and Pad1p N-termini. Domain of unknown function.


Pssm-ID: 214573  Cd Length: 135  Bit Score: 112.47  E-value: 1.81e-28
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687   2102 YTYILPKNVLKKFIciSDLRAQIAGYLYGVSPPDNPQVKEIRCIVMVPQWGTHQTV-HLPSQLPQHEYLK---EMEPLGW 2177
Cdd:smart00232    4 VHPLVPLNILKHAI--RDGPEEVCGVLLGKSNKDRPEVKEVFAVPNEPQDDSVQEYdEDYSHLMDEELKKvnkDLEIVGW 81
                            90       100       110       120       130
                    ....*....|....*....|....*....|....*....|....*....|....*.
gi 115583687   2178 IHTQPNESPQLSPQDVTTHAKIMADNPSWDGeKTIIITCSFTpGSCTLTAYKLTPS 2233
Cdd:smart00232   82 YHSHPDESPFPSEVDVATHESYQAPWPISVV-LIVDPIKSFQ-GRLSLRAFRLTPE 135
JAB pfam01398
JAB1/Mov34/MPN/PAD-1 ubiquitin protease; Members of this family are found in proteasome ...
2099-2205 5.34e-23

JAB1/Mov34/MPN/PAD-1 ubiquitin protease; Members of this family are found in proteasome regulatory subunits, eukaryotic initiation factor 3 (eIF3) subunits and regulators of transcription factors. This family is also known as the MPN domain and PAD-1-like domain, JABP1 domain or JAMM domain. These are metalloenzymes that function as the ubiquitin isopeptidase/ deubiquitinase in the ubiquitin-based signalling and protein turnover pathways in eukaryotes. Versions of the domain in prokaryotic cognates of the ubiquitin-modification pathway are shown to have a similar role, and the archael protein from Haloferax volcanii is found to cleave ubiquitin-like small archaeal modifier proteins (SAMP1/2) from protein conjugates.


Pssm-ID: 396120  Cd Length: 117  Bit Score: 95.87  E-value: 5.34e-23
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687  2099 ETGYTYILPKNVLKKFICISDLRAQI----AGYLYGVSPPDNpqVKEIRCIVMVPQWGTHQTVHLPSQLPQ-----HEYL 2169
Cdd:pfam01398    1 SSVRTVIIHPLVLLKILDHANRGGKIgeevMGVLLGKLEGDG--TIEITNSFALPQEETEDDVNAVALDQEymenmHEML 78
                           90       100       110       120
                   ....*....|....*....|....*....|....*....|
gi 115583687  2170 KE----MEPLGWIHTQPNEsPQLSPQDVTTHAKIMADNPS 2205
Cdd:pfam01398   79 KKvnrkEEVVGWYHTHPGL-CWLSSVDVHTHALYQRMIPE 117
MPN cd07767
Mpr1p, Pad1p N-terminal (MPN) domains; MPN (also known as Mov34, PAD-1, JAMM, JAB, MPN+) ...
2111-2221 2.34e-22

Mpr1p, Pad1p N-terminal (MPN) domains; MPN (also known as Mov34, PAD-1, JAMM, JAB, MPN+) domains are found in the N-terminal termini of proteins with a variety of functions; they are components of the proteasome regulatory subunits, the signalosome (CSN), eukaryotic translation initiation factor 3 (eIF3) complexes, and regulators of transcription factors. These domains are isopeptidases that release ubiquitin from ubiquitinated proteins (thus having deubiquitinating (DUB) activity) that are tagged for degradation. Catalytically active MPN domains contain a metalloprotease signature known as the JAB1/MPN/Mov34 metalloenzyme (JAMM) motif. For example, Rpn11 (also known as POH1 or PSMD14), a subunit of the 19S proteasome lid is involved in the ATP-dependent degradation of ubiquitinated proteins, contains the conserved JAMM motif involved in zinc ion coordination. Poh1 is a regulator of c-Jun, an important regulator of cell proliferation, differentiation, survival and death. JAB1 is a component of the COP9 signalosome (CSN), a regulatory particle of the ubiquitin (Ub)/26S proteasome system occurring in all eukaryotic cells; it cleaves the ubiquitin-like protein NEDD8 from the cullin subunit of the SCF (Skp1, Cullins, F-box proteins) family of E3 ubiquitin ligases. AMSH (associated molecule with the SH3 domain of STAM, also known as STAMBP), a member of JAMM/MPN+ deubiquitinases (DUBs), specifically cleaves Lys 63-linked polyubiquitin (poly-Ub) chains, thus facilitating the recycling and subsequent trafficking of receptors to the cell surface. Similarly, BRCC36, part of the nuclear complex that includes BRCA1 protein and is targeted to DNA damage foci after irradiation, specifically disassembles K63-linked polyUb. BRCC36 is aberrantly expressed in sporadic breast tumors, indicative of a potential role in the pathogenesis of the disease. Some variants of the JAB1/MPN domains lack key residues in their JAMM motif and are unable to coordinate a metal ion. Comparisons of key catalytic and metal binding residues explain why the MPN-containing proteins Mov34/PSMD7, Rpn8, CSN6, Prp8p, and the translation initiation factor 3 subunits f (p47) and h (p40) do not show catalytic isopeptidase activity. It has been proposed that the MPN domain in these proteins has a primarily structural function.


Pssm-ID: 163686 [Multi-domain]  Cd Length: 116  Bit Score: 94.12  E-value: 2.34e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 115583687 2111 LKKFI--CISDLRAQIAGYLYGVSPPDnpqVKEIRCIVMVPQWGTHQTVHLPS--QLPQHEYLKEMEPLGWIHTQPNESP 2186
Cdd:cd07767     1 LKMFLdaAKSINGKEVIGLLYGSKTKK---VLDVDEVIAVPFDEGDKDDNVWFlmYLDFKKLNAGLRIVGWYHTHPKPSC 77
                          90       100       110
                  ....*....|....*....|....*....|....*
gi 115583687 2187 QLSPQDVTTHAKIMAdnpswDGEKTIIITCSFTPG 2221
Cdd:cd07767    78 FLSPNDLATHELFQR-----YFPEKVMIIVDVKPK 107
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH