NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|49169845|ref|NP_081983|]
View 

nipped-B-like protein isoform a [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
SCC2 cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1257-2467 0e+00

Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.


:

Pssm-ID: 467937 [Multi-domain]  Cd Length: 1197  Bit Score: 1439.40  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1257 VKVLNILEKNIQDGSKLStLLNHNNDTEEEERLWRDLIMERVTKSADACLTtinIMTSPNMPKAVYIEDVIERVIQYTKF 1336
Cdd:cd23958    3 VRLLTILERNIRDGESLD-LDLDESQEDDEERLWLLERIDRALEAADASLT---ILTSPGLPKQLYSEDLIERVVDFLKF 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1337 HLQNTLYPQYDPVYRVDPHGGGLlssKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTTILQVSSMGITPF 1416
Cdd:cd23958   79 QLENTIYPAYDPVYRSDSSAKAG---KKKRAKASSKKKKSVSTLLNKLCELLSLLAELLSLQSLTDSVILQLVYLAISPF 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1417 FVE----NVSELQLCAIKLVTAVFSRYEKHRQLILEEIFTSLARLPTSKRSLRNFRLNSSDvdgepmYIQMVTALVLQLI 1492
Cdd:cd23958  156 FVEnavsNVDELQLSALKLLTSIFSRYPDQRQFIIEEILSSLAKLPSSKRNLRQFRLNDGK------SIQMVTALLLQLV 229
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1493 QCVVHLPSSEKDPNSEEDSNKKVDQ-----DVVITNSYETAMRTAQNFLSIFLKKCGSK--QGEEDYRPLFENFVQDLLS 1565
Cdd:cd23958  230 QSSVKLPNLEKESSRDKSLEEDSDElledeESALAKSYESAVRIASYFLSFLLQKCTKKkkEKDTDYRPLFENFVQDLLT 309
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1566 TVNKPEWPAAELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTskmdqgsierilkqvsggedeiQQL 1645
Cdd:cd23958  310 VLNLPEWPAAELLLSLLGRLLVSIFSNKKTDANARVMALDLLGLIAARLRKDALA----------------------EEL 367
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1646 QKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKSQKDEESSDathhakelettgqIMHRAENRKKFLRSIi 1725
Cdd:cd23958  368 QKALLDYLAENSSSDPSLESARGFYLAQWLRDLSNELEKAEKAAEEEDTIL-------------KLELSELRKKFLDSK- 433
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1726 kttpsqFSTLKMNSDTVDYDDACLIVRYLASMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLD 1805
Cdd:cd23958  434 ------ILSKEEEASPLSREDAKLLYRALASQRPLSQSFDPILKQLLSSLDEPAVTLRTKALKALSLVVEADPSILGDPD 507
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1806 MQRGVHGRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPTFPKITEMCVK 1885
Cdd:cd23958  508 VQRAVEGRLLDSSASVREAAVELVGKYISSRPDLAEQYYEMIAERILDTGVSVRKRVIKILRDIYLRTPDFEIKVDICVR 587
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1886 MIRRVND-EEGIKKLVNETFQKLWFTPTPHN-----DKEAMTRKILNITDVVAACRdTGYDWFEQLLQNLLKSEEDSSYK 1959
Cdd:cd23958  588 LLRRINDeEESIKDLARKTFQELWFTPFPESsspaqDKESLAERVLLIVDVVAACR-KGLDLLEQLLKRLLKSKEDKEDK 666
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1960 PVKKACTQLVDNLVEHILKYEESLADSdnkgvNSGRLVACITTLFLFSKIRP-QLMVKHAMTMQPYLTTKCSTQNDFMVI 2038
Cdd:cd23958  667 SVRKACKQLVDCLVELILELEEDDDES-----SESDLVACLSTLHLFAKADPkLLLVEHAETLQPYLKSKCSTREDQQVL 741
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2039 CNVAKILELVVPLMEHPSETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLKSQH 2118
Cdd:cd23958  742 RYVLRILRSVLPLLSHPSESFLEELEEDLLKLLLKHSVTVLQEAIACLCAVVNKLTKNYERLRKALQSCLKLLRKYKRQA 821
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2119 QEDPNNTsllTNKPALLRSLFTVGALCRHFDFDLEDFKGN-----SKVNIKDKVLELLMYFTKHS-DEEVQTKAIIGLGF 2192
Cdd:cd23958  822 NLDPSSL---KEDPKLLRLLYILGLLARYCDFDSERDDFEkaplkTKESVKELVFDLLLFFTKPPiDEDVRKKALQALGF 898
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2193 AFIQHPSLMFEQEVKNLYNSILSDknSSVNLKIQVLKNLQTYLQEEDTRMQQADRDWKKVAKQ-----EDLKEMGDVSSG 2267
Cdd:cd23958  899 LCIAHPKLFLSPEVLKLLDEILAS--GSLKLKLQVLRNLQEFLQAEEKRMEAADAEWKKNSKAadvkvLDGKEMGDADSG 976
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2268 MSSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFI 2347
Cdd:cd23958  977 VASSIMQRYLKDILELCLSSDSQVRLAALKVLELILRQGLVHPIQCVPTLIALETDPNPAIRKLALRLLKELHEKYESLV 1056
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2348 HMKAVAGMKMSYQVQQAINTclKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTEVTM 2421
Cdd:cd23958 1057 ESKYLEGVRLAFQYQKRLAG--DTRGRGFRTDSPPTALLGRLYSLLRGNRKSRRKFLKSLLKLFDFdlkkssDSPSDLDF 1134
                       1210      1220      1230      1240
                 ....*....|....*....|....*....|....*....|....*..
gi 49169845 2422 LLYIADNLACFPYQTQEEPLFIMHHIDITLSVSGSNLLQSF-KESMV 2467
Cdd:cd23958 1135 LLFLAENLAFLPYQTQDEPLFVIHTIDRILSVTGSSLLQAIaKASQA 1181
PspC_subgroup_2 super family cl41463
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-830 6.50e-15

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


The actual alignment was detected with superfamily member NF033839:

Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 80.58  E-value: 6.50e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   597 PETPKKKSDPELSKSEMKQNESRLSESKPNENQLgESKSNESKLETKTETPTEELKQNENKTTESKQSESAVVEPKQNEN 676
Cdd:NF033839  281 QDTPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEP-ETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPE 359
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   677 RPcdtKPNDNKQNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPE 756
Cdd:NF033839  360 KP---KPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE 436
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   757 TPK---------HRHENRRDSGKPSTEKKPDVSKHKQDIKsdsPRLKSERAEALKQRPDGRWESLRRD-HDSKQKSDDRG 826
Cdd:NF033839  437 KPKpevkpqpekPKPEVKPQPETPKPEVKPQPEKPKPEVK---PQPEKPKPDNSKPQADDKKPSTPNNlSKDKQPSNQAS 513

                  ....
gi 49169845   827 ESER 830
Cdd:NF033839  514 TNEK 517
PTZ00121 super family cl31754
MAEBL; Provisional
476-1099 6.34e-10

MAEBL; Provisional


The actual alignment was detected with superfamily member PTZ00121:

Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.55  E-value: 6.34e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   476 EIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAggatggnrPASQETGSTGNGSRPALMVSIDLHQAGRVdSQA 555
Cdd:PTZ00121 1216 EARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEI--------RKFEEARMAHFARRQAAIKAEEARKADEL-KKA 1286
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   556 SITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENH---PETPKKKSDP-----ELSKSEMKQNESRLSESKpNE 627
Cdd:PTZ00121 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAkkkADAAKKKAEEakkaaEAAKAEAEAAADEAEAAE-EK 1365
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   628 NQLGESKSNESKletkteTPTEELKQnenKTTESKQSESAVVEPKQNENRPCDTKpndnkqnntRSENTKARPETPKQKA 707
Cdd:PTZ00121 1366 AEAAEKKKEEAK------KKADAAKK---KAEEKKKADEAKKKAEEDKKKADELK---------KAAAAKKKADEAKKKA 1427
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   708 ES--RPETPKQKSEgrpetPKQKGDgrpETPKQKSEGR-PETPKQKGE--GRPETPKHRHENRR--DSGKPSTE---KKP 777
Cdd:PTZ00121 1428 EEkkKADEAKKKAE-----EAKKAD---EAKKKAEEAKkAEEAKKKAEeaKKADEAKKKAEEAKkaDEAKKKAEeakKKA 1499
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   778 DVSKHKQDIKSDSPRLK----SERAEALKQRPDGRWESLRRDHDSKQKSDDRGESERHRGDQSRVRRPETLRSSSRNDHS 853
Cdd:PTZ00121 1500 DEAKKAAEAKKKADEAKkaeeAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMA 1579
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   854 TK--SDGSKTEKLERKHRHESGDSRDRPSGEQKSRPDSPRVKQGDTNKSRPGFKSPNSKDDKRTEGNRSKVDSNKAHTDN 931
Cdd:PTZ00121 1580 LRkaEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEN 1659
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   932 KAefpsyllggRSGALKNFVIPKIKRDKDGNITQETKK-----MDMKGEQKDKVEKM-GLVEDLNKGAKPVVVLQKLSLD 1005
Cdd:PTZ00121 1660 KI---------KAAEEAKKAEEDKKKAEEAKKAEEDEKkaaeaLKKEAEEAKKAEELkKKEAEEKKKAEELKKAEEENKI 1730
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845  1006 DVQKLIKDREEKSRsslKSIKNKPSKSNKGSIDQSVLKELPPELLAEIESTMPLCERVKMNKRKRSTVNEKPKYAEISSD 1085
Cdd:PTZ00121 1731 KAEEAKKEAEEDKK---KAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
                         650       660
                  ....*....|....*....|....*.
gi 49169845  1086 ED------------NDSDEAFESSRK 1099
Cdd:PTZ00121 1808 ANiieggkegnlviNDSKEMEDSAIK 1833
 
Name Accession Description Interval E-value
SCC2 cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1257-2467 0e+00

Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.


Pssm-ID: 467937 [Multi-domain]  Cd Length: 1197  Bit Score: 1439.40  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1257 VKVLNILEKNIQDGSKLStLLNHNNDTEEEERLWRDLIMERVTKSADACLTtinIMTSPNMPKAVYIEDVIERVIQYTKF 1336
Cdd:cd23958    3 VRLLTILERNIRDGESLD-LDLDESQEDDEERLWLLERIDRALEAADASLT---ILTSPGLPKQLYSEDLIERVVDFLKF 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1337 HLQNTLYPQYDPVYRVDPHGGGLlssKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTTILQVSSMGITPF 1416
Cdd:cd23958   79 QLENTIYPAYDPVYRSDSSAKAG---KKKRAKASSKKKKSVSTLLNKLCELLSLLAELLSLQSLTDSVILQLVYLAISPF 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1417 FVE----NVSELQLCAIKLVTAVFSRYEKHRQLILEEIFTSLARLPTSKRSLRNFRLNSSDvdgepmYIQMVTALVLQLI 1492
Cdd:cd23958  156 FVEnavsNVDELQLSALKLLTSIFSRYPDQRQFIIEEILSSLAKLPSSKRNLRQFRLNDGK------SIQMVTALLLQLV 229
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1493 QCVVHLPSSEKDPNSEEDSNKKVDQ-----DVVITNSYETAMRTAQNFLSIFLKKCGSK--QGEEDYRPLFENFVQDLLS 1565
Cdd:cd23958  230 QSSVKLPNLEKESSRDKSLEEDSDElledeESALAKSYESAVRIASYFLSFLLQKCTKKkkEKDTDYRPLFENFVQDLLT 309
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1566 TVNKPEWPAAELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTskmdqgsierilkqvsggedeiQQL 1645
Cdd:cd23958  310 VLNLPEWPAAELLLSLLGRLLVSIFSNKKTDANARVMALDLLGLIAARLRKDALA----------------------EEL 367
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1646 QKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKSQKDEESSDathhakelettgqIMHRAENRKKFLRSIi 1725
Cdd:cd23958  368 QKALLDYLAENSSSDPSLESARGFYLAQWLRDLSNELEKAEKAAEEEDTIL-------------KLELSELRKKFLDSK- 433
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1726 kttpsqFSTLKMNSDTVDYDDACLIVRYLASMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLD 1805
Cdd:cd23958  434 ------ILSKEEEASPLSREDAKLLYRALASQRPLSQSFDPILKQLLSSLDEPAVTLRTKALKALSLVVEADPSILGDPD 507
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1806 MQRGVHGRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPTFPKITEMCVK 1885
Cdd:cd23958  508 VQRAVEGRLLDSSASVREAAVELVGKYISSRPDLAEQYYEMIAERILDTGVSVRKRVIKILRDIYLRTPDFEIKVDICVR 587
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1886 MIRRVND-EEGIKKLVNETFQKLWFTPTPHN-----DKEAMTRKILNITDVVAACRdTGYDWFEQLLQNLLKSEEDSSYK 1959
Cdd:cd23958  588 LLRRINDeEESIKDLARKTFQELWFTPFPESsspaqDKESLAERVLLIVDVVAACR-KGLDLLEQLLKRLLKSKEDKEDK 666
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1960 PVKKACTQLVDNLVEHILKYEESLADSdnkgvNSGRLVACITTLFLFSKIRP-QLMVKHAMTMQPYLTTKCSTQNDFMVI 2038
Cdd:cd23958  667 SVRKACKQLVDCLVELILELEEDDDES-----SESDLVACLSTLHLFAKADPkLLLVEHAETLQPYLKSKCSTREDQQVL 741
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2039 CNVAKILELVVPLMEHPSETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLKSQH 2118
Cdd:cd23958  742 RYVLRILRSVLPLLSHPSESFLEELEEDLLKLLLKHSVTVLQEAIACLCAVVNKLTKNYERLRKALQSCLKLLRKYKRQA 821
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2119 QEDPNNTsllTNKPALLRSLFTVGALCRHFDFDLEDFKGN-----SKVNIKDKVLELLMYFTKHS-DEEVQTKAIIGLGF 2192
Cdd:cd23958  822 NLDPSSL---KEDPKLLRLLYILGLLARYCDFDSERDDFEkaplkTKESVKELVFDLLLFFTKPPiDEDVRKKALQALGF 898
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2193 AFIQHPSLMFEQEVKNLYNSILSDknSSVNLKIQVLKNLQTYLQEEDTRMQQADRDWKKVAKQ-----EDLKEMGDVSSG 2267
Cdd:cd23958  899 LCIAHPKLFLSPEVLKLLDEILAS--GSLKLKLQVLRNLQEFLQAEEKRMEAADAEWKKNSKAadvkvLDGKEMGDADSG 976
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2268 MSSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFI 2347
Cdd:cd23958  977 VASSIMQRYLKDILELCLSSDSQVRLAALKVLELILRQGLVHPIQCVPTLIALETDPNPAIRKLALRLLKELHEKYESLV 1056
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2348 HMKAVAGMKMSYQVQQAINTclKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTEVTM 2421
Cdd:cd23958 1057 ESKYLEGVRLAFQYQKRLAG--DTRGRGFRTDSPPTALLGRLYSLLRGNRKSRRKFLKSLLKLFDFdlkkssDSPSDLDF 1134
                       1210      1220      1230      1240
                 ....*....|....*....|....*....|....*....|....*..
gi 49169845 2422 LLYIADNLACFPYQTQEEPLFIMHHIDITLSVSGSNLLQSF-KESMV 2467
Cdd:cd23958 1135 LLFLAENLAFLPYQTQDEPLFVIHTIDRILSVTGSSLLQAIaKASQA 1181
Nipped-B_C pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2269-2450 1.20e-69

Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.


Pssm-ID: 463722  Cd Length: 180  Bit Score: 232.43  E-value: 1.20e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   2269 SSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFIH 2348
Cdd:pfam12830    1 CSALVQRYLKHILEICLSSDDQVRLLALEVLALILRQGLVHPKECIPTLIALETSPNPYIRKLAFELHKELHEKHESLLE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   2349 MKAVAGMKMSYQVQQAINTClkdpvrgfRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTEVTML 2422
Cdd:pfam12830   81 SRYMEGIRLAFEYQRRVLSG--------ATLEPPTSFLSLLYSLLRSNKKSRKKFLKSLVKLFFDldlsseSSPSDLDFL 152
                          170       180
                   ....*....|....*....|....*...
gi 49169845   2423 LYIADNLACFPYQTQEEPLFIMHHIDIT 2450
Cdd:pfam12830  153 RFLAENLAFLPYQTQDEVLFLIHHIDRI 180
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-830 6.50e-15

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 80.58  E-value: 6.50e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   597 PETPKKKSDPELSKSEMKQNESRLSESKPNENQLgESKSNESKLETKTETPTEELKQNENKTTESKQSESAVVEPKQNEN 676
Cdd:NF033839  281 QDTPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEP-ETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPE 359
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   677 RPcdtKPNDNKQNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPE 756
Cdd:NF033839  360 KP---KPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE 436
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   757 TPK---------HRHENRRDSGKPSTEKKPDVSKHKQDIKsdsPRLKSERAEALKQRPDGRWESLRRD-HDSKQKSDDRG 826
Cdd:NF033839  437 KPKpevkpqpekPKPEVKPQPETPKPEVKPQPEKPKPEVK---PQPEKPKPDNSKPQADDKKPSTPNNlSKDKQPSNQAS 513

                  ....
gi 49169845   827 ESER 830
Cdd:NF033839  514 TNEK 517
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
551-933 1.09e-14

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 80.20  E-value: 1.09e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   551 VDSQASITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENHPETPKKKSDPELSKSEMKQNESRLSESKPNENQL 630
Cdd:NF033839  151 SSSGSSTKPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAVATYMSKILDDIQKHHLQKEKHRQI 230
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   631 GESKSNESKLETKTETPTEELKQN---ENKTTESKQSESAVVEPKQNENRPCDTKPNDNKQNNTRSENTKARPETPKQKA 707
Cdd:NF033839  231 VALIKELDELKKQALSEIDNVNTKveiENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEKKEV 310
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   708 ESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPETPK---------HRHENRRDSGKPSTEKKPD 778
Cdd:NF033839  311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKpevkpqpekPKPEVKPQPETPKPEVKPQ 390
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   779 VSKHKQDIKSDSPRLKSEraeaLKQRPDGRWESLRRDHDsKQKSDDRGESERHRGDQSrvRRPETLRSSSRNDhstksdg 858
Cdd:NF033839  391 PEKPKPEVKPQPEKPKPE----VKPQPEKPKPEVKPQPE-KPKPEVKPQPEKPKPEVK--PQPEKPKPEVKPQ------- 456
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 49169845   859 SKTEKLERKHRHESgdsrdrPSGEQKSRPDSPRvkqGDTNKSRPGFKSPNSKDdkrtEGNRSKVDSNKAHTDNKA 933
Cdd:NF033839  457 PETPKPEVKPQPEK------PKPEVKPQPEKPK---PDNSKPQADDKKPSTPN----NLSKDKQPSNQASTNEKA 518
PTZ00121 PTZ00121
MAEBL; Provisional
476-1099 6.34e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.55  E-value: 6.34e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   476 EIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAggatggnrPASQETGSTGNGSRPALMVSIDLHQAGRVdSQA 555
Cdd:PTZ00121 1216 EARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEI--------RKFEEARMAHFARRQAAIKAEEARKADEL-KKA 1286
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   556 SITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENH---PETPKKKSDP-----ELSKSEMKQNESRLSESKpNE 627
Cdd:PTZ00121 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAkkkADAAKKKAEEakkaaEAAKAEAEAAADEAEAAE-EK 1365
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   628 NQLGESKSNESKletkteTPTEELKQnenKTTESKQSESAVVEPKQNENRPCDTKpndnkqnntRSENTKARPETPKQKA 707
Cdd:PTZ00121 1366 AEAAEKKKEEAK------KKADAAKK---KAEEKKKADEAKKKAEEDKKKADELK---------KAAAAKKKADEAKKKA 1427
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   708 ES--RPETPKQKSEgrpetPKQKGDgrpETPKQKSEGR-PETPKQKGE--GRPETPKHRHENRR--DSGKPSTE---KKP 777
Cdd:PTZ00121 1428 EEkkKADEAKKKAE-----EAKKAD---EAKKKAEEAKkAEEAKKKAEeaKKADEAKKKAEEAKkaDEAKKKAEeakKKA 1499
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   778 DVSKHKQDIKSDSPRLK----SERAEALKQRPDGRWESLRRDHDSKQKSDDRGESERHRGDQSRVRRPETLRSSSRNDHS 853
Cdd:PTZ00121 1500 DEAKKAAEAKKKADEAKkaeeAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMA 1579
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   854 TK--SDGSKTEKLERKHRHESGDSRDRPSGEQKSRPDSPRVKQGDTNKSRPGFKSPNSKDDKRTEGNRSKVDSNKAHTDN 931
Cdd:PTZ00121 1580 LRkaEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEN 1659
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   932 KAefpsyllggRSGALKNFVIPKIKRDKDGNITQETKK-----MDMKGEQKDKVEKM-GLVEDLNKGAKPVVVLQKLSLD 1005
Cdd:PTZ00121 1660 KI---------KAAEEAKKAEEDKKKAEEAKKAEEDEKkaaeaLKKEAEEAKKAEELkKKEAEEKKKAEELKKAEEENKI 1730
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845  1006 DVQKLIKDREEKSRsslKSIKNKPSKSNKGSIDQSVLKELPPELLAEIESTMPLCERVKMNKRKRSTVNEKPKYAEISSD 1085
Cdd:PTZ00121 1731 KAEEAKKEAEEDKK---KAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
                         650       660
                  ....*....|....*....|....*.
gi 49169845  1086 ED------------NDSDEAFESSRK 1099
Cdd:PTZ00121 1808 ANiieggkegnlviNDSKEMEDSAIK 1833
PRK12678 PRK12678
transcription termination factor Rho; Provisional
693-919 1.55e-07

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 57.22  E-value: 1.55e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   693 SENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPETPKHRHENRRDSGKPS 772
Cdd:PRK12678   67 AATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAG 146
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   773 TEKKPDVSKHKQDIKSDSPRLKSERAEALKQRPDgrweslRRDHDSKQKSDDRGESERHRGDQSRVRRpetlrsssRNDH 852
Cdd:PRK12678  147 EGGEQPATEARADAAERTEEEERDERRRRGDRED------RQAEAERGERGRREERGRDGDDRDRRDR--------REQG 212
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 49169845   853 STKSDGSKTEKLERKHRHESGDSRDRPSGEQKSRPDSPRVKQGDTNKSRPGFKSpNSKDDKRTEGNR 919
Cdd:PRK12678  213 DRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRF-RDRDRRGRRGGD 278
ftsN TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
592-777 6.34e-07

cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]


Pssm-ID: 274041 [Multi-domain]  Cd Length: 298  Bit Score: 53.93  E-value: 6.34e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    592 IPENHPEtpkkKSDPELSKSEMKQNESRLSESKPN--ENQLGESKSNESKLETKTETPTEELKQNENKTTESKQSESAVV 669
Cdd:TIGR02223   47 LLTESKQ----ANEPETLQPKNQTENGETAADLPPkpEERWSYIEELEAREVLINDPEEPSNGGGVEESAQLTAEQRQLL 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    670 EPKQNENRPCDtkpndnKQNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPET--PKQKSEGRPETP 747
Cdd:TIGR02223  123 EQMQADMRAAE------KVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEakQKQKALPKQTAE 196
                          170       180       190
                   ....*....|....*....|....*....|
gi 49169845    748 KQKGEGRPETPkhRHENRRDSGKPSTEKKP 777
Cdd:TIGR02223  197 TQSNSKPIETA--PKADKADKTKPKPKEKA 224
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
802-919 1.61e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 43.75  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    802 KQRPDGRWESLRRDHDSKQKSD-DRgesERHRgDQSRVRRpetlRSSSRnDHSTKSDGSKTEKLERKHRHESGDSRDRPS 880
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDkGR---ERSR-DRSRDRE----RSRSR-RRDRHRDRDYYRGRERRSRSRRPNRRYRPR 73
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 49169845    881 GEQKSRPDSPRVKQGDTNKSRPGFKSPNSKDDKRTEGNR 919
Cdd:TIGR01622   74 EKRRRRGDSYRRRRDDRRSRREKPRARDGTPEPLTEDER 112
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
556-930 1.94e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 43.85  E-value: 1.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   556 SITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENHPETPKKKSDPELSKSEMKQN---------ESRLSES--- 623
Cdd:NF033838  105 NVLKEKSEAELTSKTKKELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNyptntyktlELEIAESdve 184
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   624 -KPNENQLGESKSNESKletktetPTEELKQNENKTtESKQSESAVVE----PKQNENRPCDTKPNDNKQNNTRSENTKA 698
Cdd:NF033838  185 vKKAELELVKEEAKEPR-------DEEKIKQAKAKV-ESKKAEATRLEkiktDREKAEEEAKRRADAKLKEAVEKNVATS 256
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   699 RPETPKQKAE----SRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGR-PETPKQKGEGRPETPKHRHENRRDSgKPST 773
Cdd:NF033838  257 EQDKPKRRAKrgvlGEPATPDKKENDAKSSDSSVGEETLPSPSLKPEKKvAEAEKKVEEAKKKAKDQKEEDRRNY-PTNT 335
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   774 EKKPDVSKHKQDIKSDSPRLKSERAEALKQRPDGRWESLRRDHDSKQKSDDRGEserhrgdqsrvrrpetlrsssrndhS 853
Cdd:NF033838  336 YKTLELEIAESDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAEATRLE-------------------------K 390
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 49169845   854 TKSDGSKTEKLERKHRHESGDSRDRPSgEQKSRPDSPRVKqgdtnksRPGFKSPNSKDDKRTEgnrsKVDSNKAHTD 930
Cdd:NF033838  391 IKTDRKKAEEEAKRKAAEEDKVKEKPA-EQPQPAPAPQPE-------KPAPKPEKPAEQPKAE----KPADQQAEED 455
IHABP4_N pfam16174
Intracellular hyaluronan-binding protein 4 N-terminal; IHABP4_N is the N-terminal region of ...
651-755 9.96e-03

Intracellular hyaluronan-binding protein 4 N-terminal; IHABP4_N is the N-terminal region of intracellular hyaluronan-binding protein 4-like and SERPINE1 mRNA binding protein 1-like proteins. This region carries nuclear localization sites, and may also be involved in the binding to some of the partners in the translational machinery.


Pssm-ID: 465041 [Multi-domain]  Cd Length: 140  Bit Score: 39.01  E-value: 9.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    651 LKQNENKTTESKQSESAVVEPKQNENRPCDTKPNDNKqNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKqkgd 730
Cdd:pfam16174   27 LREAENKKKLQLKIKFGEKANKGKGAGPRGGPRGGQA-RIGRKESQKDRKNDVASGQPDKPGGGGNDEPGERRPPR---- 101
                           90       100
                   ....*....|....*....|....*.
gi 49169845    731 gRPETPKQKsEGRPETPKQ-KGEGRP 755
Cdd:pfam16174  102 -RGEQQGQG-EGRRTPPRReRGERRP 125
 
Name Accession Description Interval E-value
SCC2 cd23958
Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid ...
1257-2467 0e+00

Sister chromatid cohesion protein 2 and homologs; This family includes Sister chromatid cohesion protein 2 (Scc2) and its homolog (Scc2 homolog, also called Nipped-B-like protein or NIPBL). Scc2/NIPBL and Scc4 form a complex that is responsible for loading the cohesin protein onto sister chromatids during mitosis and meiosis. Cohesin is a ring-shaped protein complex that encircles the sister chromatids and helps to hold them together until they are ready to be separated during cell division. In addition to its role in chromosome segregation, cohesin also plays important roles in other cellular processes such as transcription, chromosome condensation, and DNA repair.


Pssm-ID: 467937 [Multi-domain]  Cd Length: 1197  Bit Score: 1439.40  E-value: 0e+00
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1257 VKVLNILEKNIQDGSKLStLLNHNNDTEEEERLWRDLIMERVTKSADACLTtinIMTSPNMPKAVYIEDVIERVIQYTKF 1336
Cdd:cd23958    3 VRLLTILERNIRDGESLD-LDLDESQEDDEERLWLLERIDRALEAADASLT---ILTSPGLPKQLYSEDLIERVVDFLKF 78
                         90       100       110       120       130       140       150       160
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1337 HLQNTLYPQYDPVYRVDPHGGGLlssKAKRAKCSTHKQRVIVMLYNKVCDIVSSLSELLEIQLLTDTTILQVSSMGITPF 1416
Cdd:cd23958   79 QLENTIYPAYDPVYRSDSSAKAG---KKKRAKASSKKKKSVSTLLNKLCELLSLLAELLSLQSLTDSVILQLVYLAISPF 155
                        170       180       190       200       210       220       230       240
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1417 FVE----NVSELQLCAIKLVTAVFSRYEKHRQLILEEIFTSLARLPTSKRSLRNFRLNSSDvdgepmYIQMVTALVLQLI 1492
Cdd:cd23958  156 FVEnavsNVDELQLSALKLLTSIFSRYPDQRQFIIEEILSSLAKLPSSKRNLRQFRLNDGK------SIQMVTALLLQLV 229
                        250       260       270       280       290       300       310       320
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1493 QCVVHLPSSEKDPNSEEDSNKKVDQ-----DVVITNSYETAMRTAQNFLSIFLKKCGSK--QGEEDYRPLFENFVQDLLS 1565
Cdd:cd23958  230 QSSVKLPNLEKESSRDKSLEEDSDElledeESALAKSYESAVRIASYFLSFLLQKCTKKkkEKDTDYRPLFENFVQDLLT 309
                        330       340       350       360       370       380       390       400
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1566 TVNKPEWPAAELLLSLLGRLLVHQFSNKSTEMALRVASLDYLGTVAARLRKDAVTskmdqgsierilkqvsggedeiQQL 1645
Cdd:cd23958  310 VLNLPEWPAAELLLSLLGRLLVSIFSNKKTDANARVMALDLLGLIAARLRKDALA----------------------EEL 367
                        410       420       430       440       450       460       470       480
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1646 QKALLDYLDENTETDPSLVFSRKFYIAQWFRDTTLETEKAMKSQKDEESSDathhakelettgqIMHRAENRKKFLRSIi 1725
Cdd:cd23958  368 QKALLDYLAENSSSDPSLESARGFYLAQWLRDLSNELEKAEKAAEEEDTIL-------------KLELSELRKKFLDSK- 433
                        490       500       510       520       530       540       550       560
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1726 kttpsqFSTLKMNSDTVDYDDACLIVRYLASMRPFAQSFDIYLTQILRVLGENAIAVRTKAMKCLSEVVAVDPSILARLD 1805
Cdd:cd23958  434 ------ILSKEEEASPLSREDAKLLYRALASQRPLSQSFDPILKQLLSSLDEPAVTLRTKALKALSLVVEADPSILGDPD 507
                        570       580       590       600       610       620       630       640
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1806 MQRGVHGRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIERILDTGISVRKRVIKILRDICIEQPTFPKITEMCVK 1885
Cdd:cd23958  508 VQRAVEGRLLDSSASVREAAVELVGKYISSRPDLAEQYYEMIAERILDTGVSVRKRVIKILRDIYLRTPDFEIKVDICVR 587
                        650       660       670       680       690       700       710       720
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1886 MIRRVND-EEGIKKLVNETFQKLWFTPTPHN-----DKEAMTRKILNITDVVAACRdTGYDWFEQLLQNLLKSEEDSSYK 1959
Cdd:cd23958  588 LLRRINDeEESIKDLARKTFQELWFTPFPESsspaqDKESLAERVLLIVDVVAACR-KGLDLLEQLLKRLLKSKEDKEDK 666
                        730       740       750       760       770       780       790       800
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1960 PVKKACTQLVDNLVEHILKYEESLADSdnkgvNSGRLVACITTLFLFSKIRP-QLMVKHAMTMQPYLTTKCSTQNDFMVI 2038
Cdd:cd23958  667 SVRKACKQLVDCLVELILELEEDDDES-----SESDLVACLSTLHLFAKADPkLLLVEHAETLQPYLKSKCSTREDQQVL 741
                        810       820       830       840       850       860       870       880
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2039 CNVAKILELVVPLMEHPSETFLATIEEDLMKLIIKYGMTVVQHCVSCLGAVVNKVTQNFKFVWACFNRYYGAISKLKSQH 2118
Cdd:cd23958  742 RYVLRILRSVLPLLSHPSESFLEELEEDLLKLLLKHSVTVLQEAIACLCAVVNKLTKNYERLRKALQSCLKLLRKYKRQA 821
                        890       900       910       920       930       940       950       960
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2119 QEDPNNTsllTNKPALLRSLFTVGALCRHFDFDLEDFKGN-----SKVNIKDKVLELLMYFTKHS-DEEVQTKAIIGLGF 2192
Cdd:cd23958  822 NLDPSSL---KEDPKLLRLLYILGLLARYCDFDSERDDFEkaplkTKESVKELVFDLLLFFTKPPiDEDVRKKALQALGF 898
                        970       980       990      1000      1010      1020      1030      1040
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2193 AFIQHPSLMFEQEVKNLYNSILSDknSSVNLKIQVLKNLQTYLQEEDTRMQQADRDWKKVAKQ-----EDLKEMGDVSSG 2267
Cdd:cd23958  899 LCIAHPKLFLSPEVLKLLDEILAS--GSLKLKLQVLRNLQEFLQAEEKRMEAADAEWKKNSKAadvkvLDGKEMGDADSG 976
                       1050      1060      1070      1080      1090      1100      1110      1120
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2268 MSSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFI 2347
Cdd:cd23958  977 VASSIMQRYLKDILELCLSSDSQVRLAALKVLELILRQGLVHPIQCVPTLIALETDPNPAIRKLALRLLKELHEKYESLV 1056
                       1130      1140      1150      1160      1170      1180      1190      1200
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 2348 HMKAVAGMKMSYQVQQAINTclKDPVRGFRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTEVTM 2421
Cdd:cd23958 1057 ESKYLEGVRLAFQYQKRLAG--DTRGRGFRTDSPPTALLGRLYSLLRGNRKSRRKFLKSLLKLFDFdlkkssDSPSDLDF 1134
                       1210      1220      1230      1240
                 ....*....|....*....|....*....|....*....|....*..
gi 49169845 2422 LLYIADNLACFPYQTQEEPLFIMHHIDITLSVSGSNLLQSF-KESMV 2467
Cdd:cd23958 1135 LLFLAENLAFLPYQTQDEPLFVIHTIDRILSVTGSSLLQAIaKASQA 1181
Nipped-B_C pfam12830
Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or ...
2269-2450 1.20e-69

Sister chromatid cohesion C-terminus; This domain lies towards the C-terminus of nipped-B or sister chromatid cohesion proteins.


Pssm-ID: 463722  Cd Length: 180  Bit Score: 232.43  E-value: 1.20e-69
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   2269 SSSIMQLYLKQVLEAFFHTQSSVRHFALNVIALTLNQGLIHPVQCVPYLIAMGTDPEPAMRNKADQQLVEIDKKYAGFIH 2348
Cdd:pfam12830    1 CSALVQRYLKHILEICLSSDDQVRLLALEVLALILRQGLVHPKECIPTLIALETSPNPYIRKLAFELHKELHEKHESLLE 80
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   2349 MKAVAGMKMSYQVQQAINTClkdpvrgfRQDESSSALCSHLYSMIRGNRQHRRAFLISLLNLFDD------TAKTEVTML 2422
Cdd:pfam12830   81 SRYMEGIRLAFEYQRRVLSG--------ATLEPPTSFLSLLYSLLRSNKKSRKKFLKSLVKLFFDldlsseSSPSDLDFL 152
                          170       180
                   ....*....|....*....|....*...
gi 49169845   2423 LYIADNLACFPYQTQEEPLFIMHHIDIT 2450
Cdd:pfam12830  153 RFLAENLAFLPYQTQDEVLFLIHHIDRI 180
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
597-830 6.50e-15

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 80.58  E-value: 6.50e-15
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   597 PETPKKKSDPELSKSEMKQNESRLSESKPNENQLgESKSNESKLETKTETPTEELKQNENKTTESKQSESAVVEPKQNEN 676
Cdd:NF033839  281 QDTPKEPGNKKPSAPKPGMQPSPQPEKKEVKPEP-ETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPE 359
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   677 RPcdtKPNDNKQNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPE 756
Cdd:NF033839  360 KP---KPEVKPQPEKPKPEVKPQPETPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPEKPKPEVKPQPE 436
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   757 TPK---------HRHENRRDSGKPSTEKKPDVSKHKQDIKsdsPRLKSERAEALKQRPDGRWESLRRD-HDSKQKSDDRG 826
Cdd:NF033839  437 KPKpevkpqpekPKPEVKPQPETPKPEVKPQPEKPKPEVK---PQPEKPKPDNSKPQADDKKPSTPNNlSKDKQPSNQAS 513

                  ....
gi 49169845   827 ESER 830
Cdd:NF033839  514 TNEK 517
PspC_subgroup_2 NF033839
pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, ...
551-933 1.09e-14

pneumococcal surface protein PspC, LPXTG-anchored form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site. The other form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A.


Pssm-ID: 468202 [Multi-domain]  Cd Length: 557  Bit Score: 80.20  E-value: 1.09e-14
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   551 VDSQASITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENHPETPKKKSDPELSKSEMKQNESRLSESKPNENQL 630
Cdd:NF033839  151 SSSGSSTKPETPQPENPEHQKPTTPAPDTKPSPQPEGKKPSVPDINQEKEKAKLAVATYMSKILDDIQKHHLQKEKHRQI 230
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   631 GESKSNESKLETKTETPTEELKQN---ENKTTESKQSESAVVEPKQNENRPCDTKPNDNKQNNTRSENTKARPETPKQKA 707
Cdd:NF033839  231 VALIKELDELKKQALSEIDNVNTKveiENTVHKIFADMDAVVTKFKKGLTQDTPKEPGNKKPSAPKPGMQPSPQPEKKEV 310
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   708 ESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPETPK---------HRHENRRDSGKPSTEKKPD 778
Cdd:NF033839  311 KPEPETPKPEVKPQLEKPKPEVKPQPEKPKPEVKPQLETPKPEVKPQPEKPKpevkpqpekPKPEVKPQPETPKPEVKPQ 390
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   779 VSKHKQDIKSDSPRLKSEraeaLKQRPDGRWESLRRDHDsKQKSDDRGESERHRGDQSrvRRPETLRSSSRNDhstksdg 858
Cdd:NF033839  391 PEKPKPEVKPQPEKPKPE----VKPQPEKPKPEVKPQPE-KPKPEVKPQPEKPKPEVK--PQPEKPKPEVKPQ------- 456
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 49169845   859 SKTEKLERKHRHESgdsrdrPSGEQKSRPDSPRvkqGDTNKSRPGFKSPNSKDdkrtEGNRSKVDSNKAHTDNKA 933
Cdd:NF033839  457 PETPKPEVKPQPEK------PKPEVKPQPEKPK---PDNSKPQADDKKPSTPN----NLSKDKQPSNQASTNEKA 518
PTZ00121 PTZ00121
MAEBL; Provisional
476-1099 6.34e-10

MAEBL; Provisional


Pssm-ID: 173412 [Multi-domain]  Cd Length: 2084  Bit Score: 65.55  E-value: 6.34e-10
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   476 EIERIERESAIERERFSKEVQDKDKPLKKRKQDSYPQEAggatggnrPASQETGSTGNGSRPALMVSIDLHQAGRVdSQA 555
Cdd:PTZ00121 1216 EARKAEDAKKAEAVKKAEEAKKDAEEAKKAEEERNNEEI--------RKFEEARMAHFARRQAAIKAEEARKADEL-KKA 1286
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   556 SITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENH---PETPKKKSDP-----ELSKSEMKQNESRLSESKpNE 627
Cdd:PTZ00121 1287 EEKKKADEAKKAEEKKKADEAKKKAEEAKKADEAKKKAEEAkkkADAAKKKAEEakkaaEAAKAEAEAAADEAEAAE-EK 1365
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   628 NQLGESKSNESKletkteTPTEELKQnenKTTESKQSESAVVEPKQNENRPCDTKpndnkqnntRSENTKARPETPKQKA 707
Cdd:PTZ00121 1366 AEAAEKKKEEAK------KKADAAKK---KAEEKKKADEAKKKAEEDKKKADELK---------KAAAAKKKADEAKKKA 1427
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   708 ES--RPETPKQKSEgrpetPKQKGDgrpETPKQKSEGR-PETPKQKGE--GRPETPKHRHENRR--DSGKPSTE---KKP 777
Cdd:PTZ00121 1428 EEkkKADEAKKKAE-----EAKKAD---EAKKKAEEAKkAEEAKKKAEeaKKADEAKKKAEEAKkaDEAKKKAEeakKKA 1499
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   778 DVSKHKQDIKSDSPRLK----SERAEALKQRPDGRWESLRRDHDSKQKSDDRGESERHRGDQSRVRRPETLRSSSRNDHS 853
Cdd:PTZ00121 1500 DEAKKAAEAKKKADEAKkaeeAKKADEAKKAEEAKKADEAKKAEEKKKADELKKAEELKKAEEKKKAEEAKKAEEDKNMA 1579
                         410       420       430       440       450       460       470       480
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   854 TK--SDGSKTEKLERKHRHESGDSRDRPSGEQKSRPDSPRVKQGDTNKSRPGFKSPNSKDDKRTEGNRSKVDSNKAHTDN 931
Cdd:PTZ00121 1580 LRkaEEAKKAEEARIEEVMKLYEEEKKMKAEEAKKAEEAKIKAEELKKAEEEKKKVEQLKKKEAEEKKKAEELKKAEEEN 1659
                         490       500       510       520       530       540       550       560
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   932 KAefpsyllggRSGALKNFVIPKIKRDKDGNITQETKK-----MDMKGEQKDKVEKM-GLVEDLNKGAKPVVVLQKLSLD 1005
Cdd:PTZ00121 1660 KI---------KAAEEAKKAEEDKKKAEEAKKAEEDEKkaaeaLKKEAEEAKKAEELkKKEAEEKKKAEELKKAEEENKI 1730
                         570       580       590       600       610       620       630       640
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845  1006 DVQKLIKDREEKSRsslKSIKNKPSKSNKGSIDQSVLKELPPELLAEIESTMPLCERVKMNKRKRSTVNEKPKYAEISSD 1085
Cdd:PTZ00121 1731 KAEEAKKEAEEDKK---KAEEAKKDEEEKKKIAHLKKEEEKKAEEIRKEKEAVIEEELDEEDEKRRMEVDKKIKDIFDNF 1807
                         650       660
                  ....*....|....*....|....*.
gi 49169845  1086 ED------------NDSDEAFESSRK 1099
Cdd:PTZ00121 1808 ANiieggkegnlviNDSKEMEDSAIK 1833
Cohesin_HEAT pfam12765
HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most ...
1788-1829 2.96e-09

HEAT repeat associated with sister chromatid cohesion; This HEAT repeat is found most frequently in sister chromatid cohesion proteins such as Nipped-B. HEAT repeats are found tandemly repeated in many proteins, and they appear to serve as flexible scaffolding on which other components can assemble.


Pssm-ID: 403845 [Multi-domain]  Cd Length: 42  Bit Score: 54.39  E-value: 2.96e-09
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|..
gi 49169845   1788 KCLSEVVAVDPSILARLDMQRGVHGRLMDNSTSVREAAVELL 1829
Cdd:pfam12765    1 KALSSLVEKDPSILDSPDVKEAISRRLTDSSPSVRDAALELL 42
PRK12678 PRK12678
transcription termination factor Rho; Provisional
693-919 1.55e-07

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 57.22  E-value: 1.55e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   693 SENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPETPKHRHENRRDSGKPS 772
Cdd:PRK12678   67 AATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGEAARRGAARKAG 146
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   773 TEKKPDVSKHKQDIKSDSPRLKSERAEALKQRPDgrweslRRDHDSKQKSDDRGESERHRGDQSRVRRpetlrsssRNDH 852
Cdd:PRK12678  147 EGGEQPATEARADAAERTEEEERDERRRRGDRED------RQAEAERGERGRREERGRDGDDRDRRDR--------REQG 212
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 49169845   853 STKSDGSKTEKLERKHRHESGDSRDRPSGEQKSRPDSPRVKQGDTNKSRPGFKSpNSKDDKRTEGNR 919
Cdd:PRK12678  213 DRREERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRF-RDRDRRGRRGGD 278
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
476-909 4.69e-07

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 55.85  E-value: 4.69e-07
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   476 EIERIERESAIERERFSKEVQDK-DKPLKKRKQDSYPQEAGG---ATGGNRPASQETGSTGNGSRPALmvsidlhqagrv 551
Cdd:PTZ00449  484 EIKKLIKKSKKKLAPIEEEDSDKhDEPPEGPEASGLPPKAPGdkeGEEGEHEDSKESDEPKEGGKPGE------------ 551
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   552 dsqasiTQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENHPETPKKKSDPELSKSEMKQNESRLSESKPNENQLG 631
Cdd:PTZ00449  552 ------TKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDPKHPKDPEEPKKPKRPRSAQRPTRPKSPKLPELLDIPKSPK 625
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   632 ESKSNESKLETKTETPTEELKQNEN----KTTESKQSESAVVEPKQNEN----------RPCDTKPNDNKQNNTRSENTK 697
Cdd:PTZ00449  626 RPESPKSPKRPPPPQRPSSPERPEGpkiiKSPKPPKSPKPPFDPKFKEKfyddyldaaaKSKETKTTVVLDESFESILKE 705
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   698 ARPETPKQKAES-------RPETPKQKSE--GRPETPKQKGDGRPETPKQKSEGRPETPKQKGEGRPETPKHRHEN-RRD 767
Cdd:PTZ00449  706 TLPETPGTPFTTprplppkLPRDEEFPFEpiGDPDAEQPDDIEFFTPPEEERTFFHETPADTPLPDILAEEFKEEDiHAE 785
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   768 SGKP-STEKKPDV-SKHKQDIKSDSPRL--KSERAEALK------QRPDGRwesLRRDHDSKQKSDDRGESerhRGDQSR 837
Cdd:PTZ00449  786 TGEPdEAMKRPDSpSEHEDKPPGDHPSLpkKRHRLDGLAlsttdlESDAGR---IAKDASGKIVKLKRSKS---FDDLTT 859
                         410       420       430       440       450       460       470
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 49169845   838 VRRPETLRSSSR----NDHSTKSDGSKTEKLERKHRHESgdSRDRPSgEQKSRPDSPrvkqgdTNKSRPgfKSPNS 909
Cdd:PTZ00449  860 VEEAEEMGAEARkivvDDDGTEADDEDTHPPEEKHKSEV--RRRRPP-KKPSKPKKP------SKPKKP--KKPDS 924
ftsN TIGR02223
cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a ...
592-777 6.34e-07

cell division protein FtsN; FtsN is a poorly conserved protein active in cell division in a number of Proteobacteria. The N-terminal 30 residue region tends to by Lys/Arg-rich, and is followed by a membrane-spanning region. This is followed by an acidic low-complexity region of variable length and a well-conserved C-terminal domain of two tandem regions matched by pfam05036 (Sporulation related repeat), found in several cell division and sporulation proteins. The role of FtsN as a suppressor for other cell division mutations is poorly understood; it may involve cell wall hydrolysis. [Cellular processes, Cell division]


Pssm-ID: 274041 [Multi-domain]  Cd Length: 298  Bit Score: 53.93  E-value: 6.34e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    592 IPENHPEtpkkKSDPELSKSEMKQNESRLSESKPN--ENQLGESKSNESKLETKTETPTEELKQNENKTTESKQSESAVV 669
Cdd:TIGR02223   47 LLTESKQ----ANEPETLQPKNQTENGETAADLPPkpEERWSYIEELEAREVLINDPEEPSNGGGVEESAQLTAEQRQLL 122
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    670 EPKQNENRPCDtkpndnKQNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPET--PKQKSEGRPETP 747
Cdd:TIGR02223  123 EQMQADMRAAE------KVLATAPSEQTVAVEARKQTAEKKPQKARTAEAQKTPVETEKIASKVKEakQKQKALPKQTAE 196
                          170       180       190
                   ....*....|....*....|....*....|
gi 49169845    748 KQKGEGRPETPkhRHENRRDSGKPSTEKKP 777
Cdd:TIGR02223  197 TQSNSKPIETA--PKADKADKTKPKPKEKA 224
PRK12678 PRK12678
transcription termination factor Rho; Provisional
660-896 1.45e-06

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 53.75  E-value: 1.45e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   660 ESKQSESAVVEPKQNENRPCDTKPNDNKQNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKQKGDGRPETPKQK 739
Cdd:PRK12678   56 KEARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGE 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   740 SEGRPETPKQKGEGRPETPKHRHENRRDSGKPSTEKKPDVSKHK-----QDIKSDSPRLKSERAEALKQRPDGRWESLRR 814
Cdd:PRK12678  136 AARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREdrqaeAERGERGRREERGRDGDDRDRRDRREQGDRR 215
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   815 DHDSKQKSDDRGESERHRGDQSRVRRPETLRSSSRNDHSTKSDGSKTEKLERKHRHESGDSRDRPSGeqksrpDSPRVKQ 894
Cdd:PRK12678  216 EERGRRDGGDRRGRRRRRDRRDARGDDNREDRGDRDGDDGEGRGGRRGRRFRDRDRRGRRGGDGGNE------REPELRE 289

                  ..
gi 49169845   895 GD 896
Cdd:PRK12678  290 DD 291
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
862-1107 1.88e-05

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 50.43  E-value: 1.88e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   862 EKLERKHRhesgdsrdRPSGEQKSRPDSPRVKQGDTNKSRPGFKSPNSKDDKRTEGNRSKVDSNKAHTDNKAefPSYLLG 941
Cdd:PTZ00108 1149 EKEIAKEQ--------RLKSKTKGKASKLRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLDDK--PDNKKS 1218
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   942 GRSGALKNFVIPKIKRDKDGNITQETKKMDmKGEQKDKVEKMGLVEDLNKGAKPVVVLQKLSLDDVQKLIKDREEKSRSS 1021
Cdd:PTZ00108 1219 NSSGSDQEDDEEQKTKPKKSSVKRLKSKKN-NSSKSSEDNDEFSSDDLSKEGKPKNAPKRVSAVQYSPPPPSKRPDGESN 1297
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845  1022 LKSIKNKPSKSNKGSIDQSVLKELPPELLAEIESTmplcERVKMNKR-------KRSTVNEKPKYAEISSDEDNDSDEAF 1094
Cdd:PTZ00108 1298 GGSKPSSPTKKKVKKRLEGSLAALKKKKKSEKKTA----RKKKSKTRvkqasasQSSRLLRRPRKKKSDSSSEDDDDSEV 1373
                         250
                  ....*....|...
gi 49169845  1095 ESSRKRHKKDDDK 1107
Cdd:PTZ00108 1374 DDSEDEDDEDDED 1386
PDS5 cd19953
Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. ...
1780-1876 4.81e-04

Sister chromatid cohesion protein PDS5; Pds5 plays a crucial role in sister chromatid cohesion. Together with WapI and Scc3, it is involved in the release of the cohesin complex from chromosomes during S phase. The core of the cohesin complex consists of a coiled-coiled heterodimer of Smc1 and Smc30, together with Scc1 (also called kleisin). Pds5 interacts with Scc1 via a conserved patch on the surface of its heat repeats. Pds5 also promotes the acetylation of Smc3 that protects cohesin from releasing activity in G2 phase.


Pssm-ID: 410996 [Multi-domain]  Cd Length: 630  Bit Score: 45.59  E-value: 4.81e-04
                         10        20        30        40        50        60        70        80
                 ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845 1780 IAVRTKAMKCLSEVVAVDPS-ILARldmqrgVH--------GRLMDNSTSVREAAVELLGRFVLCRPQLAEQYYDMLIER 1850
Cdd:cd19953  259 VDVRLLATKLLGKMFAEKGSaGFAQ------TYpslwkeflGRFNDKSPEVRLAWVESAKHILLNHPDLAEDILEALKKR 332
                         90       100
                 ....*....|....*....|....*.
gi 49169845 1851 ILDTGISVRKRVIKILRDICIEQPTF 1876
Cdd:cd19953  333 LLDPDEKVRLAAVKAICDLAYEDLLH 358
PDS5 pfam20168
Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid ...
1779-1922 5.14e-04

Sister chromatid cohesion protein PDS5 protein; This entry represents the Sister chromatid cohesion protein PDS5. The large PDS5 molecule is exclusively alpha helical, composed of a large number of HEAT-like repeats and helical extensions/additions that deviate from the HEAT repeat pattern.


Pssm-ID: 466319 [Multi-domain]  Cd Length: 1051  Bit Score: 45.66  E-value: 5.14e-04
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   1779 AIAVRTKAMKCLSEVVAVDPSIlaRLDMQRGVHGRLMDNSTSVREAAVELLGRF-------VLCRPQLAEqyydmLIERI 1851
Cdd:pfam20168  297 SVAVRIAWVEAAKQILLNHPDL--RSEILEALKDRLLDPDEKVRLAAVKAIGDLdyetllhVVSEKLLKT-----LAERL 369
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   1852 LDTGISVRKRVIKILRDI-------CIEQPtfPKITEMCV----KMIR--RVNDEEgIKKLVNETFQKLWFtPTPHNDKE 1918
Cdd:pfam20168  370 RDKKPSVRKEALKTLAKLynvaygeIEEGD--EEAIEKFGwipnKILHlyYINDPE-IRALVERVLFEYLL-PALLDDEE 445

                   ....
gi 49169845   1919 AMTR 1922
Cdd:pfam20168  446 RVKR 449
dnaA PRK14086
chromosomal replication initiator protein DnaA;
700-839 6.22e-04

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 45.20  E-value: 6.22e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   700 PETPKQKAESRPETPKQKSEGRPETPKQKGDGRPETPKQKS---EGRPETPKQKGEGR----PETPKHRHENRRDSGKPS 772
Cdd:PRK14086   97 PPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPRQDqlpTARPAYPAYQQRPEpgawPRAADDYGWQQQRLGFPP 176
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 49169845   773 TEKKPDVSKHKQDIKSDSPRLKSERAEALKQRPDGRWEslRRDHDsKQKSDDRGESERHRGDQSRVR 839
Cdd:PRK14086  177 RAPYASPASYAPEQERDREPYDAGRPEYDQRRRDYDHP--RPDWD-RPRRDRTDRPEPPPGAGHVHR 240
dnaA PRK14086
chromosomal replication initiator protein DnaA;
708-903 1.42e-03

chromosomal replication initiator protein DnaA;


Pssm-ID: 237605 [Multi-domain]  Cd Length: 617  Bit Score: 44.05  E-value: 1.42e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   708 ESRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGRPETPKQkGEGRPeTPKHRHENRRDSGKPSTEKKPDVSKHKQDIK 787
Cdd:PRK14086   94 EPAPPPPHARRTSEPELPRPGRRPYEGYGGPRADDRPPGLPR-QDQLP-TARPAYPAYQQRPEPGAWPRAADDYGWQQQR 171
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   788 SDSPRLKSERAEALKQRPDGRWESlrrdhdskQKSDDRGESERHRGDQSRVRRpetlrsssrndHSTKSDGSKTEKLERK 867
Cdd:PRK14086  172 LGFPPRAPYASPASYAPEQERDRE--------PYDAGRPEYDQRRRDYDHPRP-----------DWDRPRRDRTDRPEPP 232
                         170       180       190       200
                  ....*....|....*....|....*....|....*....|....
gi 49169845   868 ----HRHESGD-SRDRPSGEQKSRPDS---PRVKQGDTNKSRPG 903
Cdd:PRK14086  233 pgagHVHRGGPgPPERDDAPVVPIRPSapgPLAAQPAPAPGPGE 276
SF-CC1 TIGR01622
splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors ...
802-919 1.61e-03

splicing factor, CC1-like family; This model represents a subfamily of RNA splicing factors including the Pad-1 protein (N. crassa), CAPER (M. musculus) and CC1.3 (H.sapiens). These proteins are characterized by an N-terminal arginine-rich, low complexity domain followed by three (or in the case of 4 H. sapiens paralogs, two) RNA recognition domains (rrm: pfam00706). These splicing factors are closely related to the U2AF splicing factor family (TIGR01642). A homologous gene from Plasmodium falciparum was identified in the course of the analysis of that genome at TIGR and was included in the seed.


Pssm-ID: 273721 [Multi-domain]  Cd Length: 494  Bit Score: 43.75  E-value: 1.61e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    802 KQRPDGRWESLRRDHDSKQKSD-DRgesERHRgDQSRVRRpetlRSSSRnDHSTKSDGSKTEKLERKHRHESGDSRDRPS 880
Cdd:TIGR01622    3 RDRERERLRDSSSAGDRDRRRDkGR---ERSR-DRSRDRE----RSRSR-RRDRHRDRDYYRGRERRSRSRRPNRRYRPR 73
                           90       100       110
                   ....*....|....*....|....*....|....*....
gi 49169845    881 GEQKSRPDSPRVKQGDTNKSRPGFKSPNSKDDKRTEGNR 919
Cdd:TIGR01622   74 EKRRRRGDSYRRRRDDRRSRREKPRARDGTPEPLTEDER 112
PspC_subgroup_1 NF033838
pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, ...
556-930 1.94e-03

pneumococcal surface protein PspC, choline-binding form; The pneumococcal surface protein PspC, as described in Streptococcus pneumoniae, is a repetitive and highly variable protein, recognized by a conserved N-terminal domain and also by genomic location. This form, subgroup 1, has variable numbers of a choline-binding repeat in the C-terminal region, and is also known as choline-binding protein A. The other form, subgroup 2, is anchored covalently after cleavage by sortase at a C-terminal LPXTG site.


Pssm-ID: 468201 [Multi-domain]  Cd Length: 684  Bit Score: 43.85  E-value: 1.94e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   556 SITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENHPETPKKKSDPELSKSEMKQN---------ESRLSES--- 623
Cdd:NF033838  105 NVLKEKSEAELTSKTKKELDAAFEQFKKDTLEPGKKVAEATKKVEEAEKKAKDQKEEDRRNyptntyktlELEIAESdve 184
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   624 -KPNENQLGESKSNESKletktetPTEELKQNENKTtESKQSESAVVE----PKQNENRPCDTKPNDNKQNNTRSENTKA 698
Cdd:NF033838  185 vKKAELELVKEEAKEPR-------DEEKIKQAKAKV-ESKKAEATRLEkiktDREKAEEEAKRRADAKLKEAVEKNVATS 256
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   699 RPETPKQKAE----SRPETPKQKSEGRPETPKQKGDGRPETPKQKSEGR-PETPKQKGEGRPETPKHRHENRRDSgKPST 773
Cdd:NF033838  257 EQDKPKRRAKrgvlGEPATPDKKENDAKSSDSSVGEETLPSPSLKPEKKvAEAEKKVEEAKKKAKDQKEEDRRNY-PTNT 335
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   774 EKKPDVSKHKQDIKSDSPRLKSERAEALKQRPDGRWESLRRDHDSKQKSDDRGEserhrgdqsrvrrpetlrsssrndhS 853
Cdd:NF033838  336 YKTLELEIAESDVKVKEAELELVKEEAKEPRNEEKIKQAKAKVESKKAEATRLE-------------------------K 390
                         330       340       350       360       370       380       390
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*..
gi 49169845   854 TKSDGSKTEKLERKHRHESGDSRDRPSgEQKSRPDSPRVKqgdtnksRPGFKSPNSKDDKRTEgnrsKVDSNKAHTD 930
Cdd:NF033838  391 IKTDRKKAEEEAKRKAAEEDKVKEKPA-EQPQPAPAPQPE-------KPAPKPEKPAEQPKAE----KPADQQAEED 455
PTZ00112 PTZ00112
origin recognition complex 1 protein; Provisional
594-935 2.12e-03

origin recognition complex 1 protein; Provisional


Pssm-ID: 240274 [Multi-domain]  Cd Length: 1164  Bit Score: 43.83  E-value: 2.12e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   594 ENHPETPKKKsdpelsksEMKQNESRLSESKPNENQLGE-----SKSNESKLETKTETPTEELKQNENKTTESKQSESAV 668
Cdd:PTZ00112   59 LSFENTPRKE--------EKKKKNLNLPDYNQIQNNTHDfyidlNERSKTPIKNNDNVTTPIKANKKEKHNLDSSSSSSI 130
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   669 VEPKQNENrpCDTKPNDNKQNNTRSENTKARPETPKQKAE-----SRPETPKQKSEGRPETPKQKG-------DGRPETP 736
Cdd:PTZ00112  131 SSSLTNIS--FFSSPTSIYSCLSNSLSSKHSPKVIKENQSthvniSSDNSPRNKEISNKQLKKQTNvthttcyDKMRRSP 208
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   737 KQKSEGRPETPKQKGEGRPETPKHRHENRRDS--GKPSTEKKPDVSKH---------KQDIKSDSPRLKS-ERAEALKQR 804
Cdd:PTZ00112  209 RNTSTIKNNTNDKNKEKNKEKDKNIKKDRDGDkqTKRNSEKSKVQNSHfdvrilrsyTKENKKDEKNVVSgIRSSVLLKR 288
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   805 PDgrwESLRRDHDSKQKSDDRgeseRHRGDQSRVRRPETLRSSSRNDHSTKSDGSKTeklerkhrhesgdsrDRPSGEQk 884
Cdd:PTZ00112  289 KS---QCLRKDSYVYSNHQKK----AKTGDPKNIIHRNNGSSNSNNDDTSSSNHLGS---------------NRISNRN- 345
                         330       340       350       360       370
                  ....*....|....*....|....*....|....*....|....*....|.
gi 49169845   885 srPDSPRVKQGDTNKSRPgfkSPNSKDDKRTEGNRSKVDSNKAHTDNKAEF 935
Cdd:PTZ00112  346 --PSSPYKKQTTTKHTNN---TKNNKYNKTKTTQKFNHPLRHHATINKRSS 391
PRK08581 PRK08581
amidase domain-containing protein;
542-764 2.53e-03

amidase domain-containing protein;


Pssm-ID: 236304 [Multi-domain]  Cd Length: 619  Bit Score: 43.24  E-value: 2.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   542 SIDLHQAGRVDSQASITQDSDSIKKPEETKQCNDAPISVLQEDIVGSLKSIPENHPETPKKKSDPELSKSEMKQNESRLS 621
Cdd:PRK08581   52 SKDTSSKDTDKADNNNTSNQDNNDKKFSTIDSSTSDSNNIIDFIYKNLPQTNINQLLTKNKYDDNYSLTTLIQNLFNLNS 131
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   622 ESKPNENQLGESKSNESKLETKTETPTEELKQN---ENKTTESKQSESAVVEPKQNENRPCDTKPNDNKQNNTRSE--NT 696
Cdd:PRK08581  132 DISDYEQPRNSEKSTNDSNKNSDSSIKNDTDTQsskQDKADNQKAPSSNNTKPSTSNKQPNSPKPTQPNQSNSQPAsdDT 211
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 49169845   697 KARPETPKQKAESRPET----PKQKSEGRPETPKQ-KGDGRPETPKQKSEGRPETPKQKGEGRPETPKHRHEN 764
Cdd:PRK08581  212 ANQKSSSKDNQSMSDSAldsiLDQYSEDAKKTQKDyASQSKKDKTETSNTKNPQLPTQDELKHKSKPAQSFEN 284
PTZ00108 PTZ00108
DNA topoisomerase 2-like protein; Provisional
708-938 8.53e-03

DNA topoisomerase 2-like protein; Provisional


Pssm-ID: 240271 [Multi-domain]  Cd Length: 1388  Bit Score: 41.96  E-value: 8.53e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   708 ESRPETPKQKSEGRPETPKQKGDGRPeTPKQKSEGRPETPKQKGEGRPETPKHRHENRRDSGKPSTEKKpDVSKHKQDIK 787
Cdd:PTZ00108 1143 EQEEVEEKEIAKEQRLKSKTKGKASK-LRKPKLKKKEKKKKKSSADKSKKASVVGNSKRVDSDEKRKLD-DKPDNKKSNS 1220
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   788 SDSPRLKSERAEALKQRPDGRWESLRRDHDSKQKSDDRGESERHRgDQSRVRRPETLRSSSRNDHSTKSdgSKTEKLERK 867
Cdd:PTZ00108 1221 SGSDQEDDEEQKTKPKKSSVKRLKSKKNNSSKSSEDNDEFSSDDL-SKEGKPKNAPKRVSAVQYSPPPP--SKRPDGESN 1297
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*...
gi 49169845   868 HRHESGDS-------RDRPSGEQKSRPDSPRVKQGDTNKSRPGFKSPNSKDDKRTEGNRSKVDSNKAHTDNKAEFPSY 938
Cdd:PTZ00108 1298 GGSKPSSPtkkkvkkRLEGSLAALKKKKKSEKKTARKKKSKTRVKQASASQSSRLLRRPRKKKSDSSSEDDDDSEVDD 1375
PRK12678 PRK12678
transcription termination factor Rho; Provisional
763-934 9.86e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 41.43  E-value: 9.86e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   763 ENRRDSGKPSTEKKPDVSKHKQDIKSDSPRLKSERAEALKQRPDGRWESLRRDHDSKQKSDDRGESERHRGDQSRVRRPE 842
Cdd:PRK12678   56 KEARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRERGE 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845   843 TLRSSSRNDHSTKSDGSKTEKLERKHRHESGDSRDRPSGEQKSRPDSPRVKQGDTNKSRpgfksPNSKDDKRTEGNRSKV 922
Cdd:PRK12678  136 AARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQAEAERGERGRRE-----ERGRDGDDRDRRDRRE 210
                         170
                  ....*....|..
gi 49169845   923 DSNKAHTDNKAE 934
Cdd:PRK12678  211 QGDRREERGRRD 222
IHABP4_N pfam16174
Intracellular hyaluronan-binding protein 4 N-terminal; IHABP4_N is the N-terminal region of ...
651-755 9.96e-03

Intracellular hyaluronan-binding protein 4 N-terminal; IHABP4_N is the N-terminal region of intracellular hyaluronan-binding protein 4-like and SERPINE1 mRNA binding protein 1-like proteins. This region carries nuclear localization sites, and may also be involved in the binding to some of the partners in the translational machinery.


Pssm-ID: 465041 [Multi-domain]  Cd Length: 140  Bit Score: 39.01  E-value: 9.96e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 49169845    651 LKQNENKTTESKQSESAVVEPKQNENRPCDTKPNDNKqNNTRSENTKARPETPKQKAESRPETPKQKSEGRPETPKqkgd 730
Cdd:pfam16174   27 LREAENKKKLQLKIKFGEKANKGKGAGPRGGPRGGQA-RIGRKESQKDRKNDVASGQPDKPGGGGNDEPGERRPPR---- 101
                           90       100
                   ....*....|....*....|....*.
gi 49169845    731 gRPETPKQKsEGRPETPKQ-KGEGRP 755
Cdd:pfam16174  102 -RGEQQGQG-EGRRTPPRReRGERRP 125
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH