NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|568926927|ref|XP_006538098|]
View 

collagen alpha-1(XXVII) chain isoform X1 [Mus musculus]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
COLFI super family cl02436
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1647-1845 1.71e-55

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


The actual alignment was detected with superfamily member pfam01410:

Pssm-ID: 470578  Cd Length: 233  Bit Score: 193.33  E-value: 1.71e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1647 EIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTQGgQTCLKPITAS-- 1724
Cdd:pfam01410    4 EVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETG-ETCIYPTKASip 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1725 --------------------KAEF---------AVSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEGPGrSSARQAVRFR 1775
Cdd:pfam01410   83 rknwwtkeskhvwfgefmngGSQFsygvdgvgpSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQAT-GNLKKALLLQ 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1776 AWNGQVFEAGG--QFRPEVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGK---QYRLEVGPACF 1845
Cdd:pfam01410  162 GSNDEEIRAEGnsRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIV---DIAPMDIGGadqEFGVEVGPVCF 233
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1354-1584 5.44e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 140.43  E-value: 5.44e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1354 EGVQGLRGEpGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPegtagsdgipg 1433
Cdd:NF038329  108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP----------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1434 rdgrPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGtEGGTGLPGNQGEPGSKGQP 1513
Cdd:NF038329  176 ----AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQ 250
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927 1514 GDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGMLGPSGLPGPKGDRGSRGDLGL 1584
Cdd:NF038329  251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1168-1455 1.14e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.27  E-value: 1.14e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1168 GQRGEPGLEgdhGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGH 1247
Cdd:NF038329  117 GEKGEPGPA---GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1248 QGylgemgipgepgppgtpgpkgSRGTLGPTGAPGRMGAQGEPGLAGYNGhkgitgplgppgpkgEKGDQGEDGktegpp 1327
Cdd:NF038329  194 QG---------------------PRGETGPAGEQGPAGPAGPDGEAGPAG---------------EDGPAGPAG------ 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1328 gppgdrgpvgdRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQpghPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGL 1407
Cdd:NF038329  232 -----------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGP---RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGL 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 568926927 1408 QGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGP 1455
Cdd:NF038329  298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
959-1234 1.54e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 136.19  E-value: 1.54e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  959 LDGSKGEPGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSR 1038
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQG---------------------------------ERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1039 GLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGflgipgpsgppgAKGLPGEPGs 1118
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG------------PAGEDGPAG- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1119 qgpqgpVGPPGEMGPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDG 1198
Cdd:NF038329  229 ------PAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDG 302
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 568926927 1199 EHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQG 1234
Cdd:NF038329  303 KDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-974 1.90e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 120.78  E-value: 1.90e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  728 FPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLI 807
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  808 GDTGALGPVGYPGPKGMKGLMGGVGEPGLKGDKGEQGvpgvSGDPGFQGDKGSHGLPGLPGGRGKPGPLGKAGDKGslgf 887
Cdd:NF038329  195 GPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG---- 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  888 pgppgPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEgpmgppGVPGLEGQPGRKGFPGRPGLDGSKGEPG 967
Cdd:NF038329  267 -----EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN------GKDGLPGKDGKDGQPGKDGLPGKDGKDG 335

                  ....*..
gi 568926927  968 DPGRPGP 974
Cdd:NF038329  336 QPGKPAP 342
gly_rich_SclB super family cl45768
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
610-822 4.48e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


The actual alignment was detected with superfamily member NF038329:

Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 4.48e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  610 GPPGSKGDCGLPGPPGLPGLPGSPGARGPRGPPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGP 689
Cdd:NF038329  123 GPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGP 202
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  690 LGRKGHKGHPGAAGHPGEQGQPGPEGSPGA--------KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGL 761
Cdd:NF038329  203 AGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqqgpdgdPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927  762 PGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPVGYPGPK 822
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
LamG super family cl22861
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
43-221 2.33e-13

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


The actual alignment was detected with superfamily member smart00210:

Pssm-ID: 473984  Cd Length: 184  Bit Score: 70.46  E-value: 2.33e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927     43 DVDVLQRLGL-----SWTKAGGGRSPTPpgvipfpsGFIFTQRAKLQAPTANVLPTTLGRELALVLSLCSHRVNHAFLFA 117
Cdd:smart00210    1 GQDLLQVFDLpslsfAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFA 72
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927    118 IRSRKHKLQLGLQFLPGRTIIHL------GPRQSVAF-DLDVHDGRWHHLALELRGRTVTMVTACGQHRVpVPLPSRRDS 190
Cdd:smart00210   73 IYDAQNVRQFGLEVDGRANTLLLryqgvdGKQHTVSFrNLPLADGQWHKLALSVSGSSATLYVDCNEIDS-RPLDRPGQP 151
                           170       180       190
                    ....*....|....*....|....*....|.
gi 568926927    191 MLDPQGSFLLGKVNPRAVQFEGALCQFSIHP 221
Cdd:smart00210  152 PIDTDGIEVRGAQAADRKPFQGDLQQLKIVC 182
Herpes_BLLF1 super family cl37540
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
269-567 2.28e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


The actual alignment was detected with superfamily member pfam05109:

Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 56.08  E-value: 2.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   269 GLGNLTRTPATLGArPVSRALAVTLAPAMPTKPLRTvhpdvsehSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP- 347
Cdd:pfam05109  447 GLPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGT--------TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPt 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   348 --AAAQPRQITTTSPTKRSPTKPSVSPLSVTPMKSPHATQKTgvPSFTKPVP----PT-QKPAPFTSYLAPS-KASSPTV 419
Cdd:pfam05109  518 pnATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT--PAVTTPTPnatiPTlGKTSPTSAVTTPTpNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   420 rpvqktfmtprppvpspqplrPTTGLSKKFTNPTVAKSkSKTTSWASKPVLARSSVpKTLQQTVLSQSPVSY-LGSQTLA 498
Cdd:pfam05109  596 ---------------------GETSPQANTTNHTLGGT-SSTPVVTSPPKNATSAV-TTGQHNITSSSTSSMsLRPSSIS 652
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927   499 PALPPLGVGNPRTMPPTRDSAlTPAGSKKFTGRETSKKTRQKSSPRKPEPlSPGKSARDASPRD--LTTKP 567
Cdd:pfam05109  653 ETLSPSTSDNSTSHMPLLTSA-HPTGGENITQVTPASTSTHHVSTSSPAP-RPGTTSQASGPGNssTSTKP 721
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1647-1845 1.71e-55

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 193.33  E-value: 1.71e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1647 EIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTQGgQTCLKPITAS-- 1724
Cdd:pfam01410    4 EVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETG-ETCIYPTKASip 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1725 --------------------KAEF---------AVSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEGPGrSSARQAVRFR 1775
Cdd:pfam01410   83 rknwwtkeskhvwfgefmngGSQFsygvdgvgpSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQAT-GNLKKALLLQ 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1776 AWNGQVFEAGG--QFRPEVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGK---QYRLEVGPACF 1845
Cdd:pfam01410  162 GSNDEEIRAEGnsRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIV---DIAPMDIGGadqEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1646-1846 4.03e-52

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 183.44  E-value: 4.03e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   1646 GEIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTqGGQTCLKPITASK 1725
Cdd:smart00038    2 EEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFE-TGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   1726 A----------------------EFA--------VSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEgPGRSSARQAVRFR 1775
Cdd:smart00038   81 PrktwysgkskhvwfgetmnggfKFSygdsegppVGVVQLTFLRLLSTEAHQNITYHCKNSVAYMD-EATGNLKKALRLR 159
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927   1776 AWNGQVFEAGGQFRP--EVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGKQYR---LEVGPACFL 1846
Cdd:smart00038  160 GSNDVELSAEGNSKFtyEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIV---DIAPSDIGGPDQefgVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1354-1584 5.44e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 140.43  E-value: 5.44e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1354 EGVQGLRGEpGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPegtagsdgipg 1433
Cdd:NF038329  108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP----------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1434 rdgrPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGtEGGTGLPGNQGEPGSKGQP 1513
Cdd:NF038329  176 ----AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQ 250
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927 1514 GDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGMLGPSGLPGPKGDRGSRGDLGL 1584
Cdd:NF038329  251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1168-1455 1.14e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.27  E-value: 1.14e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1168 GQRGEPGLEgdhGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGH 1247
Cdd:NF038329  117 GEKGEPGPA---GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1248 QGylgemgipgepgppgtpgpkgSRGTLGPTGAPGRMGAQGEPGLAGYNGhkgitgplgppgpkgEKGDQGEDGktegpp 1327
Cdd:NF038329  194 QG---------------------PRGETGPAGEQGPAGPAGPDGEAGPAG---------------EDGPAGPAG------ 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1328 gppgdrgpvgdRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQpghPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGL 1407
Cdd:NF038329  232 -----------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGP---RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGL 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 568926927 1408 QGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGP 1455
Cdd:NF038329  298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1313-1548 1.36e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.27  E-value: 1.36e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1313 EKGDQGEDGktegPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGK 1392
Cdd:NF038329  118 EKGEPGPAG----PAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1393 PGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGiPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQG 1472
Cdd:NF038329  194 QGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDG 272
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927 1473 PPGFKGESGLPgqlGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGP 1548
Cdd:NF038329  273 PDGKDGERGPV---GPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
959-1234 1.54e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 136.19  E-value: 1.54e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  959 LDGSKGEPGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSR 1038
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQG---------------------------------ERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1039 GLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGflgipgpsgppgAKGLPGEPGs 1118
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG------------PAGEDGPAG- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1119 qgpqgpVGPPGEMGPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDG 1198
Cdd:NF038329  229 ------PAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDG 302
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 568926927 1199 EHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQG 1234
Cdd:NF038329  303 KDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1033-1249 9.01e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 127.71  E-value: 9.01e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1033 GPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGL 1112
Cdd:NF038329  126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1113 PGEPGSQGPQGPVGPPGEMGPKGPPGA--VGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPglegdhGPVGPDGLKGD 1190
Cdd:NF038329  206 QGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA------GPDGPDGKDGE 279
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568926927 1191 RGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQG 1249
Cdd:NF038329  280 RGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-974 1.90e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 120.78  E-value: 1.90e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  728 FPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLI 807
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  808 GDTGALGPVGYPGPKGMKGLMGGVGEPGLKGDKGEQGvpgvSGDPGFQGDKGSHGLPGLPGGRGKPGPLGKAGDKGslgf 887
Cdd:NF038329  195 GPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG---- 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  888 pgppgPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEgpmgppGVPGLEGQPGRKGFPGRPGLDGSKGEPG 967
Cdd:NF038329  267 -----EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN------GKDGLPGKDGKDGQPGKDGLPGKDGKDG 335

                  ....*..
gi 568926927  968 DPGRPGP 974
Cdd:NF038329  336 QPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
640-859 6.48e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 109.99  E-value: 6.48e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  640 GPPGPYGNPGPpgppgakgqKGDPGLSPGQAHDGAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGA 719
Cdd:NF038329  126 GPAGPAGEQGP---------RGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  720 KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGS--DGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGL 797
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568926927  798 PGPPGVLGLIGDTGALGPVGYPGPKGMKglmGGVGEPGLKGDKGEQGVPGVSGDPGFQGDKG 859
Cdd:NF038329  277 DGERGPVGPAGKDGQNGKDGLPGKDGKD---GQNGKDGLPGKDGKDGQPGKDGLPGKDGKDG 335
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
808-1093 1.43e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 103.06  E-value: 1.43e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  808 GDTGALGPVGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGDKGshglpglpggrgKPGPLGKAGDKgslgf 887
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQG------------EAGPQGPAGKD----- 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  888 pgppgpegfpgdiGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRKGfPGRPGLDGSKGEPG 967
Cdd:NF038329  180 -------------GEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTG 245
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  968 DPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSRGLPGMRGAK 1047
Cdd:NF038329  246 EDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDG---------------------------------ERGPVGPAGKDGQN 292
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*.
gi 568926927 1048 GHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSG 1093
Cdd:NF038329  293 GKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
610-822 4.48e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 4.48e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  610 GPPGSKGDCGLPGPPGLPGLPGSPGARGPRGPPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGP 689
Cdd:NF038329  123 GPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGP 202
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  690 LGRKGHKGHPGAAGHPGEQGQPGPEGSPGA--------KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGL 761
Cdd:NF038329  203 AGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqqgpdgdPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927  762 PGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPVGYPGPK 822
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
43-221 2.33e-13

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 70.46  E-value: 2.33e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927     43 DVDVLQRLGL-----SWTKAGGGRSPTPpgvipfpsGFIFTQRAKLQAPTANVLPTTLGRELALVLSLCSHRVNHAFLFA 117
Cdd:smart00210    1 GQDLLQVFDLpslsfAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFA 72
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927    118 IRSRKHKLQLGLQFLPGRTIIHL------GPRQSVAF-DLDVHDGRWHHLALELRGRTVTMVTACGQHRVpVPLPSRRDS 190
Cdd:smart00210   73 IYDAQNVRQFGLEVDGRANTLLLryqgvdGKQHTVSFrNLPLADGQWHKLALSVSGSSATLYVDCNEIDS-RPLDRPGQP 151
                           170       180       190
                    ....*....|....*....|....*....|.
gi 568926927    191 MLDPQGSFLLGKVNPRAVQFEGALCQFSIHP 221
Cdd:smart00210  152 PIDTDGIEVRGAQAADRKPFQGDLQQLKIVC 182
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1485-1585 2.49e-11

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 68.01  E-value: 2.49e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1485 QLGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGML 1564
Cdd:NF038329  112 QLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEK 191
                          90       100
                  ....*....|....*....|.
gi 568926927 1565 GPSGLPGPKGDRGSRGDLGLQ 1585
Cdd:NF038329  192 GPQGPRGETGPAGEQGPAGPA 212
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
269-567 2.28e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 56.08  E-value: 2.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   269 GLGNLTRTPATLGArPVSRALAVTLAPAMPTKPLRTvhpdvsehSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP- 347
Cdd:pfam05109  447 GLPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGT--------TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPt 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   348 --AAAQPRQITTTSPTKRSPTKPSVSPLSVTPMKSPHATQKTgvPSFTKPVP----PT-QKPAPFTSYLAPS-KASSPTV 419
Cdd:pfam05109  518 pnATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT--PAVTTPTPnatiPTlGKTSPTSAVTTPTpNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   420 rpvqktfmtprppvpspqplrPTTGLSKKFTNPTVAKSkSKTTSWASKPVLARSSVpKTLQQTVLSQSPVSY-LGSQTLA 498
Cdd:pfam05109  596 ---------------------GETSPQANTTNHTLGGT-SSTPVVTSPPKNATSAV-TTGQHNITSSSTSSMsLRPSSIS 652
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927   499 PALPPLGVGNPRTMPPTRDSAlTPAGSKKFTGRETSKKTRQKSSPRKPEPlSPGKSARDASPRD--LTTKP 567
Cdd:pfam05109  653 ETLSPSTSDNSTSHMPLLTSA-HPTGGENITQVTPASTSTHHVSTSSPAP-RPGTTSQASGPGNssTSTKP 721
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1409-1583 3.60e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 3.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1409 GLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGP 1488
Cdd:COG5164    10 GPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGG 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1489 PGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLF--GPKGPPGDIGfkgiQGPRGpPGLMGKEGIIGPPGMLGP 1566
Cdd:COG5164    90 TRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGG----STPPG-PGSTGPGGSTTPPGDGGS 164
                         170
                  ....*....|....*..
gi 568926927 1567 SGLPGPKGDRGSRGDLG 1583
Cdd:COG5164   165 TTPPGPGGSTTPPDDGG 181
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
745-801 1.93e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 1.93e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927   745 GLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPP 801
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1042-1096 2.67e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 2.67e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1042 GMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKG 1096
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
682-737 3.88e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 3.88e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927   682 GLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGP 737
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1177-1476 4.22e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 48.10  E-value: 4.22e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1177 GDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQGYLGEMGI 1256
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1257 PGEPGPPGTPGPKGSRGTLGPTGAPGRMGAQGEPGLAGYNGHKGiTGPLGPPgpkgekgdqGEDGKTEGPPGPPGDRGPV 1336
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPS-GGSTTPP---------GDGGSTPPGPGSTGPGGST 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1337 GDRGDRGEPGDPGYPGQEGVQGLRGE--PGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGtqGLQGLPGPR 1414
Cdd:COG5164   157 TPPGDGGSTTPPGPGGSTTPPDDGGSttPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQR 234
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568926927 1415 GVVGRQGPEGTAGSDGIPGRDGRPGYQGDQ--GNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGF 1476
Cdd:COG5164   235 PKTNPIERRGPERPEAAALPAELTALEAENraANPEPATKTIPETTTVKDLATVLGKKGSDLVT 298
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1340-1553 4.64e-05

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 48.47  E-value: 4.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1340 GDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPrgrpgpkgskgEEGPKGKPGKAGPSGRRGTQGLQGlpGPRGVVGR 1419
Cdd:pfam09606  178 GGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGP-----------ADAGAQMGQQAQANGGMNPQQMGG--APNQVAMQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1420 QGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAglPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTG 1499
Cdd:pfam09606  245 QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQ--PGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNH 322
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1500 LPGNQGEPGSKGQPGdsGEMGFPGVAGLFGPkGPPGDIGFKGIQG-PRGPPGLMG 1553
Cdd:pfam09606  323 PAAHQQQMNQSVGQG--GQVVALGGLNHLET-WNPGNFGGLGANPmQRGQPGMMS 374
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
699-932 6.10e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 47.72  E-value: 6.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  699 PGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSrgyiglpglfglPGSDGERGLPGVPGKRGEMGRPGFPG 778
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRP------------AQNQGSTTPAGNTGGTRPAGNQGATG 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  779 DFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPVGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGDK 858
Cdd:COG5164    74 PAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPG 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568926927  859 GSHGLPGLPGGRGKPGPLGKAGDKGSlgfpGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGD 932
Cdd:COG5164   154 GSTTPPGDGGSTTPPGPGGSTTPPDD----GGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDD 223
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
318-577 1.08e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.23  E-value: 1.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  318 PLSPAKQSARKTPSPSSSASLANSTRVYRPAAAQP-RQITTTSPTKRSPTKP---SVSPLS-------VTPMKSPHATQK 386
Cdd:PLN03209  312 PLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPvTPEAPSPPIEEEPPQPkavVPRPLSpytayedLKPPTSPIPTPP 391
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  387 TGVPSFTKPVPPTQKPAPFTSYLAPSKASS---PTVRPVQKTFMTPRPPVPSPQPLRPTTGLSKK---FTNPTVAKSKSK 460
Cdd:PLN03209  392 SSSPASSKSVDAVAKPAEPDVVPSPGSASNvpeVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTaptGVSPSVSSTSSV 471
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  461 TTSWASKPVLARSSvpKTLQQTVLSQSPVSYLGSQTLAPALPPLGVGNPRTMPPTRDSALTPAGSkkfTGRETSKKTRQK 540
Cdd:PLN03209  472 PAVPDTAPATAATD--AAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGN---SAPPTALADEQH 546
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568926927  541 SSPRKPEPLSPGKSARDASPrdlttkPSRPsTPALVL 577
Cdd:PLN03209  547 HAQPKPRPLSPYTMYEDLKP------PTSP-TPSPVL 576
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
872-1087 1.66e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 46.18  E-value: 1.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  872 KPGPLGKAGDKGSLGFPGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRK 951
Cdd:COG5164     8 KTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQ 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  952 GFPGRPGLDGSKGEPGDPGRPGPVGEQGLMGFIGLVG-----EPGIVGEKGDRGVMGPPGAPGPKGSMGHPGTPGGIGNP 1026
Cdd:COG5164    88 GGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGsttppSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTTP 167
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927 1027 GEPGPWGPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAG 1087
Cdd:COG5164   168 PGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKT 228
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1180-1235 2.13e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 2.13e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927  1180 GPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGE 1235
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
124-205 7.57e-04

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 42.02  E-value: 7.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  124 KLQLGLQFLPGRTIIHLGPRqsvafdldVHDGRWHHLALELRGRTVTMVTACGQHrVPVPLPsRRDSMLDPQGSFLLGKV 203
Cdd:cd00110    57 RLVLRYDLGSGSLVLSSKTP--------LNDGQWHSVSVERNGRSVTLSVDGERV-VESGSP-GGSALLNLDGPLYLGGL 126

                  ..
gi 568926927  204 NP 205
Cdd:cd00110   127 PE 128
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1361-1550 7.71e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 7.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1361 GEPGQQGQPGHPGPRGRPGPKGS---KGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGR 1437
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAarpAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1438 PGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGfkgesglpgqlgPPGKRGTEGGTGLPGNQGEPGSKGQPGDSG 1517
Cdd:PRK07764  670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAA------------TPPAGQADDPAAQPPQAAQGASAPSPAADD 737
                         170       180       190
                  ....*....|....*....|....*....|....
gi 568926927 1518 EMGFPGVAGLF-GPKGPPGDIGFKGIQGPRGPPG 1550
Cdd:PRK07764  738 PVPLPPEPDDPpDPAGAPAQPPPPPAPAPAAAPA 771
PHA03169 PHA03169
hypothetical protein; Provisional
1074-1235 1.87e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 42.65  E-value: 1.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1074 RGRPGQPGQQGAAGErGHSGAKGFLGIPGPSGPPGAKGLPGEPGSqgpqgpvgPPGEMGPKGPPGAVGEPGLPGDsGMKG 1153
Cdd:PHA03169   81 HGEKEERGQGGPSGS-GSESVGSPTPSPSGSAEELASGLSPENTS--------GSSPESPASHSPPPSPPSHPGP-HEPA 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1154 DLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKG--EDGSPGPPGITGVPGREGKPG 1231
Cdd:PHA03169  151 PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPpdEPGEPQSPTPQQAPSPNTQQA 230

                  ....
gi 568926927 1232 KQGE 1235
Cdd:PHA03169  231 VEHE 234
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
124-206 3.21e-03

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 39.33  E-value: 3.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   124 KLQLGLQFLPGRTIIHLGPRQsvafdldVHDGRWHHLALELRGRTVTMVTaCGQHRVPVPLPSRRDsMLDPQGSFLLGKV 203
Cdd:pfam02210   29 RLVLRYDLGSGPESLLSSGKN-------LNDGQWHSVRVERNGNTLTLSV-DGQTVVSSLPPGESL-LLNLNGPLYLGGL 99

                   ...
gi 568926927   204 NPR 206
Cdd:pfam02210  100 PPL 102
 
Name Accession Description Interval E-value
COLFI pfam01410
Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1647-1845 1.71e-55

Fibrillar collagen C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1 alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 460199  Cd Length: 233  Bit Score: 193.33  E-value: 1.71e-55
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1647 EIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTQGgQTCLKPITAS-- 1724
Cdd:pfam01410    4 EVMATLKSLSQQIENIRSPDGSKKNPARTCRDLKLCHPDWKSGEYWIDPNQGCTRDAIKVFCNFETG-ETCIYPTKASip 82
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1725 --------------------KAEF---------AVSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEGPGrSSARQAVRFR 1775
Cdd:pfam01410   83 rknwwtkeskhvwfgefmngGSQFsygvdgvgpSVAAVQLTFLRLLSTEASQNITYHCKNSVAYMDQAT-GNLKKALLLQ 161
                          170       180       190       200       210       220       230
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1776 AWNGQVFEAGG--QFRPEVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGK---QYRLEVGPACF 1845
Cdd:pfam01410  162 GSNDEEIRAEGnsRFTYTVLEDGCTKRTGQWGKTVIEYRTQKVSRLPIV---DIAPMDIGGadqEFGVEVGPVCF 233
COLFI smart00038
Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia ...
1646-1846 4.03e-52

Fibrillar collagens C-terminal domain; Found at C-termini of fibrillar collagens: Ephydatia muelleri procollagen EMF1alpha, vertebrate collagens alpha(1)III, alpha(1)II, alpha(2)V etc.


Pssm-ID: 197483  Cd Length: 232  Bit Score: 183.44  E-value: 4.03e-52
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   1646 GEIFKTLHYLSNLIQSIKTPLGTKENPARVCRDLMDCEQRMADGTYWVDPNLGCSSDTIEVSCNFTqGGQTCLKPITASK 1725
Cdd:smart00038    2 EEVFASLKSLNNQIEQLKSPTGSRKNPARTCKDLKLCHPEWKSGEYWVDPNQGCIRDAIKVFCNFE-TGETCVSPSPSSI 80
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   1726 A----------------------EFA--------VSRVQMNFLHLLSSEGTQHITIHCLNMTVWQEgPGRSSARQAVRFR 1775
Cdd:smart00038   81 PrktwysgkskhvwfgetmnggfKFSygdsegppVGVVQLTFLRLLSTEAHQNITYHCKNSVAYMD-EATGNLKKALRLR 159
                           170       180       190       200       210       220       230
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927   1776 AWNGQVFEAGGQFRP--EVSMDGCKVHDGRWHQTLFTFRTQDPQQLPIVsvdNLPPVSSGKQYR---LEVGPACFL 1846
Cdd:smart00038  160 GSNDVELSAEGNSKFtyEVLEDGCQKRTGKWGKTVIEYRTKKTERLPIV---DIAPSDIGGPDQefgVEIGPVCFS 232
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1354-1584 5.44e-35

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 140.43  E-value: 5.44e-35
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1354 EGVQGLRGEpGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPegtagsdgipg 1433
Cdd:NF038329  108 EGLQQLKGD-GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGP----------- 175
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1434 rdgrPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGtEGGTGLPGNQGEPGSKGQP 1513
Cdd:NF038329  176 ----AGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQ 250
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927 1514 GDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGMLGPSGLPGPKGDRGSRGDLGL 1584
Cdd:NF038329  251 GPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQ 321
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1168-1455 1.14e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.27  E-value: 1.14e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1168 GQRGEPGLEgdhGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGH 1247
Cdd:NF038329  117 GEKGEPGPA---GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1248 QGylgemgipgepgppgtpgpkgSRGTLGPTGAPGRMGAQGEPGLAGYNGhkgitgplgppgpkgEKGDQGEDGktegpp 1327
Cdd:NF038329  194 QG---------------------PRGETGPAGEQGPAGPAGPDGEAGPAG---------------EDGPAGPAG------ 231
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1328 gppgdrgpvgdRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQpghPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGL 1407
Cdd:NF038329  232 -----------DGQQGPDGDPGPTGEDGPQGPDGPAGKDGP---RGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGL 297
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*...
gi 568926927 1408 QGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGP 1455
Cdd:NF038329  298 PGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1313-1548 1.36e-34

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 139.27  E-value: 1.36e-34
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1313 EKGDQGEDGktegPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGK 1392
Cdd:NF038329  118 EKGEPGPAG----PAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGP 193
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1393 PGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGiPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQG 1472
Cdd:NF038329  194 QGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDG 272
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927 1473 PPGFKGESGLPgqlGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGP 1548
Cdd:NF038329  273 PDGKDGERGPV---GPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPKTP 345
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
959-1234 1.54e-33

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 136.19  E-value: 1.54e-33
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  959 LDGSKGEPGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSR 1038
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQG---------------------------------ERGEK 161
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1039 GLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGflgipgpsgppgAKGLPGEPGs 1118
Cdd:NF038329  162 GPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAG------------PAGEDGPAG- 228
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1119 qgpqgpVGPPGEMGPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDG 1198
Cdd:NF038329  229 ------PAGDGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDG 302
                         250       260       270
                  ....*....|....*....|....*....|....*.
gi 568926927 1199 EHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQG 1234
Cdd:NF038329  303 KDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1033-1249 9.01e-31

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 127.71  E-value: 9.01e-31
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1033 GPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGL 1112
Cdd:NF038329  126 GPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGPAGE 205
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1113 PGEPGSQGPQGPVGPPGEMGPKGPPGA--VGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPglegdhGPVGPDGLKGD 1190
Cdd:NF038329  206 QGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEA------GPDGPDGKDGE 279
                         170       180       190       200       210
                  ....*....|....*....|....*....|....*....|....*....|....*....
gi 568926927 1191 RGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQG 1249
Cdd:NF038329  280 RGPVGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
728-974 1.90e-28

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 120.78  E-value: 1.90e-28
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  728 FPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLI 807
Cdd:NF038329  115 GDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQ 194
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  808 GDTGALGPVGYPGPKGMKGLMGGVGEPGLKGDKGEQGvpgvSGDPGFQGDKGSHGLPGLPGGRGKPGPLGKAGDKGslgf 887
Cdd:NF038329  195 GPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG----DGQQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRG---- 266
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  888 pgppgPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEgpmgppGVPGLEGQPGRKGFPGRPGLDGSKGEPG 967
Cdd:NF038329  267 -----EAGPDGPDGKDGERGPVGPAGKDGQNGKDGLPGKDGKDGQN------GKDGLPGKDGKDGQPGKDGLPGKDGKDG 335

                  ....*..
gi 568926927  968 DPGRPGP 974
Cdd:NF038329  336 QPGKPAP 342
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
640-859 6.48e-25

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 109.99  E-value: 6.48e-25
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  640 GPPGPYGNPGPpgppgakgqKGDPGLSPGQAHDGAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGA 719
Cdd:NF038329  126 GPAGPAGEQGP---------RGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGP 196
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  720 KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGS--DGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGL 797
Cdd:NF038329  197 RGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqQGPDGDPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGK 276
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568926927  798 PGPPGVLGLIGDTGALGPVGYPGPKGMKglmGGVGEPGLKGDKGEQGVPGVSGDPGFQGDKG 859
Cdd:NF038329  277 DGERGPVGPAGKDGQNGKDGLPGKDGKD---GQNGKDGLPGKDGKDGQPGKDGLPGKDGKDG 335
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
808-1093 1.43e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 103.06  E-value: 1.43e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  808 GDTGALGPVGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGDKGshglpglpggrgKPGPLGKAGDKgslgf 887
Cdd:NF038329  117 GEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQG------------EAGPQGPAGKD----- 179
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  888 pgppgpegfpgdiGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRKGfPGRPGLDGSKGEPG 967
Cdd:NF038329  180 -------------GEAGAKGPAGEKGPQGPRGETGPAGEQGPAGPAGPDGEAGPAGEDGPAGPAG-DGQQGPDGDPGPTG 245
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  968 DPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGvmgppgapgpkgsmghpgtpggignpgepgpwgPPGSRGLPGMRGAK 1047
Cdd:NF038329  246 EDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDG---------------------------------ERGPVGPAGKDGQN 292
                         250       260       270       280
                  ....*....|....*....|....*....|....*....|....*.
gi 568926927 1048 GHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSG 1093
Cdd:NF038329  293 GKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPG 338
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
610-822 4.48e-22

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 101.52  E-value: 4.48e-22
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  610 GPPGSKGDCGLPGPPGLPGLPGSPGARGPRGPPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGP 689
Cdd:NF038329  123 GPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEKGPQGPRGETGP 202
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  690 LGRKGHKGHPGAAGHPGEQGQPGPEGSPGA--------KGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGL 761
Cdd:NF038329  203 AGEQGPAGPAGPDGEAGPAGEDGPAGPAGDgqqgpdgdPGPTGEDGPQGPDGPAGKDGPRGDRGEAGPDGPDGKDGERGP 282
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927  762 PGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPVGYPGPK 822
Cdd:NF038329  283 VGPAGKDGQNGKDGLPGKDGKDGQNGKDGLPGKDGKDGQPGKDGLPGKDGKDGQPGKPAPK 343
TSPN smart00210
Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of ...
43-221 2.33e-13

Thrombospondin N-terminal -like domains; Heparin-binding and cell adhesion domain of thrombospondin


Pssm-ID: 214560  Cd Length: 184  Bit Score: 70.46  E-value: 2.33e-13
                            10        20        30        40        50        60        70        80
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927     43 DVDVLQRLGL-----SWTKAGGGRSPTPpgvipfpsGFIFTQRAKLQAPTANVLPTTLGRELALVLSLCSHRVNHAFLFA 117
Cdd:smart00210    1 GQDLLQVFDLpslsfAIRQVVGPEPGSP--------AYRLGDPALVPQPTRDLFPSGLPEDFSLLTTFRQTPKSRGVLFA 72
                            90       100       110       120       130       140       150       160
                    ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927    118 IRSRKHKLQLGLQFLPGRTIIHL------GPRQSVAF-DLDVHDGRWHHLALELRGRTVTMVTACGQHRVpVPLPSRRDS 190
Cdd:smart00210   73 IYDAQNVRQFGLEVDGRANTLLLryqgvdGKQHTVSFrNLPLADGQWHKLALSVSGSSATLYVDCNEIDS-RPLDRPGQP 151
                           170       180       190
                    ....*....|....*....|....*....|.
gi 568926927    191 MLDPQGSFLLGKVNPRAVQFEGALCQFSIHP 221
Cdd:smart00210  152 PIDTDGIEVRGAQAADRKPFQGDLQQLKIVC 182
gly_rich_SclB NF038329
LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like ...
1485-1585 2.49e-11

LPXTG-anchored collagen-like adhesin Scl2/SclB; SclB (or Scl2 - streptococcal collagen-like protein 2) is an LPXTG-anchored surface-anchored adhesin with a variable-length region of triple helix-forming collagen-like Gly-Xaa-Xaa repeats.


Pssm-ID: 468478 [Multi-domain]  Cd Length: 440  Bit Score: 68.01  E-value: 2.49e-11
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1485 QLGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPPGML 1564
Cdd:NF038329  112 QLKGDGEKGEPGPAGPAGPAGEQGPRGDRGETGPAGPAGPPGPQGERGEKGPAGPQGEAGPQGPAGKDGEAGAKGPAGEK 191
                          90       100
                  ....*....|....*....|.
gi 568926927 1565 GPSGLPGPKGDRGSRGDLGLQ 1585
Cdd:NF038329  192 GPQGPRGETGPAGEQGPAGPA 212
Herpes_BLLF1 pfam05109
Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 ...
269-567 2.28e-07

Herpes virus major outer envelope glycoprotein (BLLF1); This family consists of the BLLF1 viral late glycoprotein, also termed gp350/220. It is the most abundantly expressed glycoprotein in the viral envelope of the Herpesviruses and is the major antigen responsible for stimulating the production of neutralising antibodies in vivo.


Pssm-ID: 282904 [Multi-domain]  Cd Length: 886  Bit Score: 56.08  E-value: 2.28e-07
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   269 GLGNLTRTPATLGArPVSRALAVTLAPAMPTKPLRTvhpdvsehSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP- 347
Cdd:pfam05109  447 GLPSSTHVPTNLTA-PASTGPTVSTADVTSPTPAGT--------TSGASPVTPSPSPRDNGTESKAPDMTSPTSAVTTPt 517
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   348 --AAAQPRQITTTSPTKRSPTKPSVSPLSVTPMKSPHATQKTgvPSFTKPVP----PT-QKPAPFTSYLAPS-KASSPTV 419
Cdd:pfam05109  518 pnATSPTPAVTTPTPNATSPTLGKTSPTSAVTTPTPNATSPT--PAVTTPTPnatiPTlGKTSPTSAVTTPTpNATSPTV 595
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   420 rpvqktfmtprppvpspqplrPTTGLSKKFTNPTVAKSkSKTTSWASKPVLARSSVpKTLQQTVLSQSPVSY-LGSQTLA 498
Cdd:pfam05109  596 ---------------------GETSPQANTTNHTLGGT-SSTPVVTSPPKNATSAV-TTGQHNITSSSTSSMsLRPSSIS 652
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927   499 PALPPLGVGNPRTMPPTRDSAlTPAGSKKFTGRETSKKTRQKSSPRKPEPlSPGKSARDASPRD--LTTKP 567
Cdd:pfam05109  653 ETLSPSTSDNSTSHMPLLTSA-HPTGGENITQVTPASTSTHHVSTSSPAP-RPGTTSQASGPGNssTSTKP 721
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1409-1583 3.60e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 3.60e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1409 GLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGP 1488
Cdd:COG5164    10 GPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGG 89
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1489 PGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLF--GPKGPPGDIGfkgiQGPRGpPGLMGKEGIIGPPGMLGP 1566
Cdd:COG5164    90 TRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGG----STPPG-PGSTGPGGSTTPPGDGGS 164
                         170
                  ....*....|....*..
gi 568926927 1567 SGLPGPKGDRGSRGDLG 1583
Cdd:COG5164   165 TTPPGPGGSTTPPDDGG 181
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1427-1583 4.66e-06

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 51.57  E-value: 4.66e-06
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1427 GSDGIPGRDGRPGYQGDQGNDGDPGPVG---PAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPGN 1503
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGstrPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1504 qgePGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPP--GLMGKEGIIGP----PGMLGPSGLPGPKGDRG 1577
Cdd:COG5164    87 ---QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPsgGSTTPPGDGGStppgPGSTGPGGSTTPPGDGG 163

                  ....*.
gi 568926927 1578 SRGDLG 1583
Cdd:COG5164   164 STTPPG 169
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
308-579 1.90e-05

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 49.57  E-value: 1.90e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   308 DVSEHSSSQTPLSPAKQSARKTPSPSSSASLANstrvyrpAAAQPRQITTTSPTKRSPTKP---SVSPLSVTPMKSPHAT 384
Cdd:pfam17823  108 DGAASRALAAAASSSPSSAAQSLPAAIAALPSE-------AFSAPRAAACRANASAAPRAAiaaASAPHAASPAPRTAAS 180
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   385 QKTGVPSFT-KPVPPTQKPAPFTSYLAP----SKASSPTVRPVQKTFMTP-RPPVPSPQPLRPTTGLSKKFTNPTVAKSK 458
Cdd:pfam17823  181 STTAASSTTaASSAPTTAASSAPATLTPargiSTAATATGHPAAGTALAAvGNSSPAAGTVTAAVGTVTPAALATLAAAA 260
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   459 SKTTSWA-----SKPVLARSSVPKTLQQTVLSQSPVSYLGSQTLAPAL-----PPLGVGNPRTMPPTRDSALTPAGSKKF 528
Cdd:pfam17823  261 GTVASAAgtinmGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQGPIIqvstdQPVHNTAGEPTPSPSNTTLEPNTPKSV 340
                          250       260       270       280       290       300
                   ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927   529 TGRE----TSKKTRQKSSPRKPEPLSPGKSARDASPRDLTTKPS------RPSTPALVLAP 579
Cdd:pfam17823  341 ASTNlavvTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQPSpllptqGAAGPGILLAP 401
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
745-801 1.93e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.64  E-value: 1.93e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927   745 GLPGLFGLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPP 801
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1042-1096 2.67e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 43.25  E-value: 2.67e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1042 GMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKG 1096
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
682-737 3.88e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.87  E-value: 3.88e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927   682 GLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGP 737
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
1177-1476 4.22e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 48.10  E-value: 4.22e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1177 GDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKGHQGYLGEMGI 1256
Cdd:COG5164     7 GKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQN 86
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1257 PGEPGPPGTPGPKGSRGTLGPTGAPGRMGAQGEPGLAGYNGHKGiTGPLGPPgpkgekgdqGEDGKTEGPPGPPGDRGPV 1336
Cdd:COG5164    87 QGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPS-GGSTTPP---------GDGGSTPPGPGSTGPGGST 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1337 GDRGDRGEPGDPGYPGQEGVQGLRGE--PGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGtqGLQGLPGPR 1414
Cdd:COG5164   157 TPPGDGGSTTPPGPGGSTTPPDDGGSttPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRG--GKTGPKDQR 234
                         250       260       270       280       290       300
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568926927 1415 GVVGRQGPEGTAGSDGIPGRDGRPGYQGDQ--GNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGF 1476
Cdd:COG5164   235 PKTNPIERRGPERPEAAALPAELTALEAENraANPEPATKTIPETTTVKDLATVLGKKGSDLVT 298
Med15 pfam09606
ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of ...
1340-1553 4.64e-05

ARC105 or Med15 subunit of Mediator complex non-fungal; The approx. 70 residue Med15 domain of the ARC-Mediator co-activator is a three-helix bundle with marked similarity to the KIX domain. The sterol regulatory element binding protein (SREBP) family of transcription activators use the ARC105 subunit to activate target genes in the regulation of cholesterol and fatty acid homeostasis. In addition, Med15 is a critical transducer of gene activation signals that control early metazoan development.


Pssm-ID: 312941 [Multi-domain]  Cd Length: 732  Bit Score: 48.47  E-value: 4.64e-05
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1340 GDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPrgrpgpkgskgEEGPKGKPGKAGPSGRRGTQGLQGlpGPRGVVGR 1419
Cdd:pfam09606  178 GGPGQGQAGGMNGGQQGPMGGQMPPQMGVPGMPGP-----------ADAGAQMGQQAQANGGMNPQQMGG--APNQVAMQ 244
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1420 QGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAglPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTG 1499
Cdd:pfam09606  245 QQQPQQQGQQSQLGMGINQMQQMPQGVGGGAGQGGPGQPMGPPGQQ--PGAMPNVMSIGDQNNYQQQQTRQQQQQQGGNH 322
                          170       180       190       200       210
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1500 LPGNQGEPGSKGQPGdsGEMGFPGVAGLFGPkGPPGDIGFKGIQG-PRGPPGLMG 1553
Cdd:pfam09606  323 PAAHQQQMNQSVGQG--GQVVALGGLNHLET-WNPGNFGGLGANPmQRGQPGMMS 374
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
699-932 6.10e-05

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 47.72  E-value: 6.10e-05
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  699 PGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSrgyiglpglfglPGSDGERGLPGVPGKRGEMGRPGFPG 778
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRP------------AQNQGSTTPAGNTGGTRPAGNQGATG 73
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  779 DFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPVGYPGPKGMKGLMGGVGEPGLKGDKGEQGVPGVSGDPGFQGDK 858
Cdd:COG5164    74 PAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPG 153
                         170       180       190       200       210       220       230
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568926927  859 GSHGLPGLPGGRGKPGPLGKAGDKGSlgfpGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGD 932
Cdd:COG5164   154 GSTTPPGDGGSTTPPGPGGSTTPPDD----GGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDD 223
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1358-1414 6.41e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 6.41e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927  1358 GLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPR 1414
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1346-1400 7.72e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 42.10  E-value: 7.72e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1346 GDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSG 1400
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
712-768 9.68e-05

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 9.68e-05
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927   712 GPEGSPGAKGYPGRQGFPGPVGDPGPKGSRGYIGLPGLFGLPGSDGERGLPGVPGKR 768
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PLN03209 PLN03209
translocon at the inner envelope of chloroplast subunit 62; Provisional
318-577 1.08e-04

translocon at the inner envelope of chloroplast subunit 62; Provisional


Pssm-ID: 178748 [Multi-domain]  Cd Length: 576  Bit Score: 47.23  E-value: 1.08e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  318 PLSPAKQSARKTPSPSSSASLANSTRVYRPAAAQP-RQITTTSPTKRSPTKP---SVSPLS-------VTPMKSPHATQK 386
Cdd:PLN03209  312 PLTPMEELLAKIPSQRVPPKESDAADGPKPVPTKPvTPEAPSPPIEEEPPQPkavVPRPLSpytayedLKPPTSPIPTPP 391
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  387 TGVPSFTKPVPPTQKPAPFTSYLAPSKASS---PTVRPVQKTFMTPRPPVPSPQPLRPTTGLSKK---FTNPTVAKSKSK 460
Cdd:PLN03209  392 SSSPASSKSVDAVAKPAEPDVVPSPGSASNvpeVEPAQVEAKKTRPLSPYARYEDLKPPTSPSPTaptGVSPSVSSTSSV 471
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  461 TTSWASKPVLARSSvpKTLQQTVLSQSPVSYLGSQTLAPALPPLGVGNPRTMPPTRDSALTPAGSkkfTGRETSKKTRQK 540
Cdd:PLN03209  472 PAVPDTAPATAATD--AAAPPPANMRPLSPYAVYDDLKPPTSPSPAAPVGKVAPSSTNEVVKVGN---SAPPTALADEQH 546
                         250       260       270
                  ....*....|....*....|....*....|....*..
gi 568926927  541 SSPRKPEPLSPGKSARDASPrdlttkPSRPsTPALVL 577
Cdd:PLN03209  547 HAQPKPRPLSPYTMYEDLKP------PTSP-TPSPVL 576
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1433-1489 1.09e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.71  E-value: 1.09e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927  1433 GRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPP 1489
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
676-732 1.23e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.23e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927   676 GNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPV 732
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1361-1415 1.30e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.30e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1361 GEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRG 1415
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1370-1424 1.41e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.41e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1370 GHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEG 1424
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1367-1422 1.58e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.58e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927  1367 GQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGP 1422
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1421-1475 1.58e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 41.33  E-value: 1.58e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1421 GPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPG 1475
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
872-1087 1.66e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 46.18  E-value: 1.66e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  872 KPGPLGKAGDKGSLGFPGPPGPEGFPGDIGPPGDNGPEGMKGKPGARGLPGPPGQLGPEGDEGPMGPPGVPGLEGQPGRK 951
Cdd:COG5164     8 KTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQ 87
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  952 GFPGRPGLDGSKGEPGDPGRPGPVGEQGLMGFIGLVG-----EPGIVGEKGDRGVMGPPGAPGPKGSMGHPGTPGGIGNP 1026
Cdd:COG5164    88 GGTRPAGNTGGTTPAGDGGATGPPDDGGATGPPDDGGsttppSGGSTTPPGDGGSTPPGPGSTGPGGSTTPPGDGGSTTP 167
                         170       180       190       200       210       220
                  ....*....|....*....|....*....|....*....|....*....|....*....|.
gi 568926927 1027 GEPGPWGPPGSRGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAG 1087
Cdd:COG5164   168 PGPGGSTTPPDDGGSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKT 228
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1448-1502 2.10e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 2.10e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1448 GDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPG 1502
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1180-1235 2.13e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.94  E-value: 2.13e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927  1180 GPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGE 1235
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
694-748 2.72e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.72e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927   694 GHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSRGYIGLPG 748
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1487-1541 2.94e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.55  E-value: 2.94e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1487 GPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKG 1541
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1466-1520 3.12e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.12e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1466 GLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPGNQGEPGSKGQPGDSGEMG 1520
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1460-1514 3.54e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 40.17  E-value: 3.54e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1460 GNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLPGNQGEPGSKGQPG 1514
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
966-1230 3.91e-04

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 45.02  E-value: 3.91e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  966 PGDPGRPGPVGEQGLMGFIGLVGEPGIVGEKGDRGVMGPPGAPGPKGSMGHPGTPGGignpgePGPWGPPGSRGLPGMRG 1045
Cdd:COG5164     6 PGKTGPSDPGGVTTPAGSQGSTKPAQNQGSTRPAGNTGGTRPAQNQGSTTPAGNTGG------TRPAGNQGATGPAQNQG 79
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1046 AKGHRGPRGPDGPAGEQGSKGlkgRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGLPGEPGSQGPQGPV 1125
Cdd:COG5164    80 GTTPAQNQGGTRPAGNTGGTT---PAGDGGATGPPDDGGATGPPDDGGSTTPPSGGSTTPPGDGGSTPPGPGSTGPGGST 156
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1126 GPPGEMGPKGPPGAVGEPGLPGD--SGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDR-GDPGPDGEHGE 1202
Cdd:COG5164   157 TPPGDGGSTTPPGPGGSTTPPDDggSTTPPNKGETGTDIPTGGTPRQGPDGPVKKDDKNGKGNPPDDRgGKTGPKDQRPK 236
                         250       260
                  ....*....|....*....|....*...
gi 568926927 1203 KGQEGLKGEDGSPGPPGITGVPGREGKP 1230
Cdd:COG5164   237 TNPIERRGPERPEAAALPAELTALEAEN 264
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
685-741 4.23e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.78  E-value: 4.23e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927   685 GNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSR 741
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1505-1561 6.40e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.40  E-value: 6.40e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927  1505 GEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPGLMGKEGIIGPP 1561
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
LamG cd00110
Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have ...
124-205 7.57e-04

Laminin G domain; Laminin G-like domains are usually Ca++ mediated receptors that can have binding sites for steroids, beta1 integrins, heparin, sulfatides, fibulin-1, and alpha-dystroglycans. Proteins that contain LamG domains serve a variety of purposes including signal transduction via cell-surface steroid receptors, adhesion, migration and differentiation through mediation of cell adhesion molecules.


Pssm-ID: 238058 [Multi-domain]  Cd Length: 151  Bit Score: 42.02  E-value: 7.57e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  124 KLQLGLQFLPGRTIIHLGPRqsvafdldVHDGRWHHLALELRGRTVTMVTACGQHrVPVPLPsRRDSMLDPQGSFLLGKV 203
Cdd:cd00110    57 RLVLRYDLGSGSLVLSSKTP--------LNDGQWHSVSVERNGRSVTLSVDGERV-VESGSP-GGSALLNLDGPLYLGGL 126

                  ..
gi 568926927  204 NP 205
Cdd:cd00110   127 PE 128
PHA03247 PHA03247
large tegument protein UL36; Provisional
294-573 7.67e-04

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 44.54  E-value: 7.67e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  294 APAMPTK-PLRTVHPDVSEHSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRP--AAAQPRQITTTSPT---KRSPTK 367
Cdd:PHA03247 2610 GPAPPSPlPPDTHAPDPPPPSPSPAANEPDPHPPPTVPPPERPRDDPAPGRVSRPrrARRLGRAAQASSPPqrpRRRAAR 2689
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  368 PSVSPLS---------VTPMKSPHATQkTGVPSFTKPV-------PPTQKPAPFTSYLAPSKASSPTVRPVQKTFMTPRP 431
Cdd:PHA03247 2690 PTVGSLTsladpppppPTPEPAPHALV-SATPLPPGPAaarqaspALPAAPAPPAVPAGPATPGGPARPARPPTTAGPPA 2768
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  432 PVPSPQplrPTTGLSKKFTNPTVAKSKSKTTSWASKPVLARSSVPKTLQQTVL--SQSPVSYL----GSQTLAPALPPLG 505
Cdd:PHA03247 2769 PAPPAA---PAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALppAASPAGPLppptSAQPTAPPPPPGP 2845
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  506 VGNPRTM-------------PPTRDSALTPAGSKKFTGRETSKKTRQKSS------PRKPEPLSPGKSARDASPRDLTTK 566
Cdd:PHA03247 2846 PPPSLPLggsvapggdvrrrPPSRSPAAKPAAPARPPVRRLARPAVSRSTesfalpPDQPERPPQPQAPPPPQPQPQPPP 2925

                  ....*..
gi 568926927  567 PSRPSTP 573
Cdd:PHA03247 2926 PPQPQPP 2932
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1361-1550 7.71e-04

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 44.59  E-value: 7.71e-04
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1361 GEPGQQGQPGHPGPRGRPGPKGS---KGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGR 1437
Cdd:PRK07764  590 PAPGAAGGEGPPAPASSGPPEEAarpAAPAAPAAPAAPAPAGAAAAPAEASAAPAPGVAAPEHHPKHVAVPDASDGGDGW 669
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1438 PGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGfkgesglpgqlgPPGKRGTEGGTGLPGNQGEPGSKGQPGDSG 1517
Cdd:PRK07764  670 PAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAA------------TPPAGQADDPAAQPPQAAQGASAPSPAADD 737
                         170       180       190
                  ....*....|....*....|....*....|....
gi 568926927 1518 EMGFPGVAGLF-GPKGPPGDIGFKGIQGPRGPPG 1550
Cdd:PRK07764  738 PVPLPPEPDDPpDPAGAPAQPPPPPAPAPAAAPA 771
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
688-742 7.94e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 7.94e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927   688 GPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQGFPGPVGDPGPKGSRG 742
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1339-1382 8.59e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 8.59e-04
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....
gi 568926927  1339 RGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKG 1382
Cdd:pfam01391   12 PGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1424-1479 8.85e-04

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 8.85e-04
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927  1424 GTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGE 1479
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
754-802 1.03e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 1.03e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 568926927   754 GSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPG 802
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPG 49
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1156-1218 1.06e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 39.01  E-value: 1.06e-03
                           10        20        30        40        50        60
                   ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568926927  1156 GPLGPPGEQGLIGQRGEPGlegdhgPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGEDGSPGPP 1218
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPG------PPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
PTZ00449 PTZ00449
104 kDa microneme/rhoptry antigen; Provisional
262-573 1.13e-03

104 kDa microneme/rhoptry antigen; Provisional


Pssm-ID: 185628 [Multi-domain]  Cd Length: 943  Bit Score: 43.91  E-value: 1.13e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  262 HPEPALLGLGNLTRTPATLGARPVSRALAVTLAPAMPTKPlrtVHP-DVSEHSSSQTPLSPAKQSARKTpspsssaslan 340
Cdd:PTZ00449  546 GGKPGETKEGEVGKKPGPAKEHKPSKIPTLSKKPEFPKDP---KHPkDPEEPKKPKRPRSAQRPTRPKS----------- 611
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  341 strvyrPAAAQPRQITTTSPTKRSPTKPSvSPLSVTPMKSPHATQKTGVPSFTKPvPPTQKPaPFtsylapskasSPTVR 420
Cdd:PTZ00449  612 ------PKLPELLDIPKSPKRPESPKSPK-RPPPPQRPSSPERPEGPKIIKSPKP-PKSPKP-PF----------DPKFK 672
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  421 pvqktfmtprppvpspqplrpttglSKKFTNPTVAKSKSKTTswasKPVLARSSVPKTLQQTVLSQSPVSYLGSQTLAPA 500
Cdd:PTZ00449  673 -------------------------EKFYDDYLDAAAKSKET----KTTVVLDESFESILKETLPETPGTPFTTPRPLPP 723
                         250       260       270       280       290       300       310
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568926927  501 LPPLGVGNPRTmPPTRDSALTPAGSKKFTGREtSKKTRQKSSPrkPEPLSPGKSARDASPRDLTTKPSRPSTP 573
Cdd:PTZ00449  724 KLPRDEEFPFE-PIGDPDAEQPDDIEFFTPPE-EERTFFHETP--ADTPLPDILAEEFKEEDIHAETGEPDEA 792
PHA03247 PHA03247
large tegument protein UL36; Provisional
245-686 1.36e-03

large tegument protein UL36; Provisional


Pssm-ID: 223021 [Multi-domain]  Cd Length: 3151  Bit Score: 43.77  E-value: 1.36e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  245 PQVGTLFPWDSGPAFALHPEPALLGLgnLTRTPATLGARPVSRAL-AVTLAPAMPTKPLRTVHPdvsehSSSQTPLSPAK 323
Cdd:PHA03247 2690 PTVGSLTSLADPPPPPPTPEPAPHAL--VSATPLPPGPAAARQASpALPAAPAPPAVPAGPATP-----GGPARPARPPT 2762
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  324 QSARKTPSPSSSASLANSTRVYRPAAAQPRQITTTSPTKRSPTKPSVSPLSVTPMKSPHATQKTGVPSFTK--PVPPTQK 401
Cdd:PHA03247 2763 TAGPPAPAPPAAPAAGPPRRLTRPAVASLSESRESLPSPWDPADPPAAVLAPAAALPPAASPAGPLPPPTSaqPTAPPPP 2842
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  402 PAPFTSYLAPSKASSPTvRPVQKtfmtprppvpspqplRPTTGLskkfTNPTVAKSKSKTTSWASKPVLARSSVPKTLQQ 481
Cdd:PHA03247 2843 PGPPPPSLPLGGSVAPG-GDVRR---------------RPPSRS----PAAKPAAPARPPVRRLARPAVSRSTESFALPP 2902
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  482 TVLSQSPVSYLGSQTLAPALPPLgvgNPRTMPPTRDSALTPAGSKKFTGRETSKKTRQKSSPRKPEPLSPGksaRDASPR 561
Cdd:PHA03247 2903 DQPERPPQPQAPPPPQPQPQPPP---PPQPQPPPPPPPRPQPPLAPTTDPAGAGEPSGAVPQPWLGALVPG---RVAVPR 2976
                         330       340       350       360       370       380       390       400
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  562 DLTTKPsRPSTPALVLAPAYLLSSSPQPTSSSFPFFHL-LGPTPfpmlmgPPGSKGDCGLPGPPGLPGLPGSPGARGPRG 640
Cdd:PHA03247 2977 FRVPQP-APSREAPASSTPPLTGHSLSRVSSWASSLALhEETDP------PPVSLKQTLWPPDDTEDSDADSLFDSDSER 3049
                         410       420       430       440
                  ....*....|....*....|....*....|....*....|....*.
gi 568926927  641 PPGPYGNPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGN 686
Cdd:PHA03247 3050 SDLEALDPLPPEPHDPFAHEPDPATPEAGARESPSSQFGPPPLSAN 3095
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1385-1439 1.36e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 1.36e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1385 GEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPG 1439
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1349-1403 1.41e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.63  E-value: 1.41e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1349 GYPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRG 1403
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PRK12678 PRK12678
transcription termination factor Rho; Provisional
1283-1463 1.46e-03

transcription termination factor Rho; Provisional


Pssm-ID: 237171 [Multi-domain]  Cd Length: 672  Bit Score: 43.35  E-value: 1.46e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1283 RMGAQGEPGLAGYNGHKGITGPLGPPGPKGEKGDQGEDGKTEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQglRGE 1362
Cdd:PRK12678   58 ARGGGAAAAAATPAAPAAAARRAARAAAAARQAEQPAAEAAAAKAEAAPAARAAAAAAAEAASAPEAAQARERRE--RGE 135
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1363 PGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGlpgPRGVVGRQGPEGTAGSDGIPGRDGRPGYQG 1442
Cdd:PRK12678  136 AARRGAARKAGEGGEQPATEARADAAERTEEEERDERRRRGDREDRQ---AEAERGERGRREERGRDGDDRDRRDRREQG 212
                         170       180
                  ....*....|....*....|.
gi 568926927 1443 DQGNDGDPGPVGpaGRRGNPG 1463
Cdd:PRK12678  213 DRREERGRRDGG--DRRGRRR 231
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
266-421 1.50e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.41  E-value: 1.50e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   266 ALLGLGNLTRTPATLGAR--PVSRALAVTLAPAMPTkpLRTVHPDVSEHSSSQTPLSPAKQSARKTPSPSSSASLANSTR 343
Cdd:pfam17823  227 ALAAVGNSSPAAGTVTAAvgTVTPAALATLAAAAGT--VASAAGTINMGDPHARRLSPAKHMPSDTMARNPAAPMGAQAQ 304
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   344 --VYRPAAAQPRQITTTSPTKrSPTKPSVSPLSVTPMKSPHATQKTGVPSFTKPVPPTQKPAPFTSYLAPSKASSPTVRP 421
Cdd:pfam17823  305 gpIIQVSTDQPVHNTAGEPTP-SPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVLHTSMIPEVEATSPTTQP 383
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1141-1196 1.52e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.52e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927  1141 GEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGP 1196
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
PHA03169 PHA03169
hypothetical protein; Provisional
1338-1481 1.55e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.04  E-value: 1.55e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1338 DRGDRGEPGDPGyPGQEGVQGLRGEPGQQGQPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPgprgvv 1417
Cdd:PHA03169   98 ESVGSPTPSPSG-SAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGPHEPAPPESHNPSPNQQPSSFLQPS------ 170
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|....
gi 568926927 1418 GRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAgrrgNPGVAGLPGAQGPPGFKGESG 1481
Cdd:PHA03169  171 HEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPPD----EPGEPQSPTPQQAPSPNTQQA 230
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
751-802 1.55e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.55e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 568926927   751 GLPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPG 802
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPG 52
PHA03169 PHA03169
hypothetical protein; Provisional
1313-1452 1.56e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 43.04  E-value: 1.56e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1313 EKGDQGEDGKTEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQQGQPGHPGPrGRPGPKGSKGEEGPKGK 1392
Cdd:PHA03169   85 EERGQGGPSGSGSESVGSPTPSPSGSAEELASGLSPENTSGSSPESPASHSPPPSPPSHPGP-HEPAPPESHNPSPNQQP 163
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568926927 1393 PGKAGPSGRRGTQ---GLQGLPGPRGVVGRQGPEGTAGSDGIPGRDgRPGYQGDQGNDGDPGP 1452
Cdd:PHA03169  164 SSFLQPSHEDSPEepePPTSEPEPDSPGPPQSETPTSSPPPQSPPD-EPGEPQSPTPQQAPSP 225
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
273-572 1.59e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 43.02  E-value: 1.59e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   273 LTRTPATLGARPVSR-ALAVTLAPAMPTKPLRTVHPDVSEHSSSQTPLSPAKQSARKTPSPSSSASLANSTRVYRPAAAQ 351
Cdd:pfam17823  162 IAAASAPHAASPAPRtAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAAVGNSSPAAGTV 241
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   352 PRQITTTSPTKRSPTKPSVSPLSVTP----MKSPHATQktgvPSFTKPVPP---TQKPAPFTSYLAPSKASSPTV-RPVQ 423
Cdd:pfam17823  242 TAAVGTVTPAALATLAAAAGTVASAAgtinMGDPHARR----LSPAKHMPSdtmARNPAAPMGAQAQGPIIQVSTdQPVH 317
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   424 KTfMTPRPPVPSPQPLRPTTGLSKKFTNPTV---AKSKSKTTSWASKPVLARSSVPKtlqqtVLSQSPVSylgsqtlapa 500
Cdd:pfam17823  318 NT-AGEPTPSPSNTTLEPNTPKSVASTNLAVvttTKAQAKEPSASPVPVLHTSMIPE-----VEATSPTT---------- 381
                          250       260       270       280       290       300       310
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568926927   501 lpplgvgNPRTMPPTRDSAltpagskkftGRETSKKTRQKSSPRKPEPLSPGKSARDA-SPRDLTTKPSRPST 572
Cdd:pfam17823  382 -------QPSPLLPTQGAA----------GPGILLAPEQVATEATAGTASAGPTPRSSgDPKTLAMASCQLST 437
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1376-1430 1.73e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.73e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1376 GRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDG 1430
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
PHA03169 PHA03169
hypothetical protein; Provisional
1074-1235 1.87e-03

hypothetical protein; Provisional


Pssm-ID: 223003 [Multi-domain]  Cd Length: 413  Bit Score: 42.65  E-value: 1.87e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1074 RGRPGQPGQQGAAGErGHSGAKGFLGIPGPSGPPGAKGLPGEPGSqgpqgpvgPPGEMGPKGPPGAVGEPGLPGDsGMKG 1153
Cdd:PHA03169   81 HGEKEERGQGGPSGS-GSESVGSPTPSPSGSAEELASGLSPENTS--------GSSPESPASHSPPPSPPSHPGP-HEPA 150
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1154 DLGPLGPPGEQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKG--EDGSPGPPGITGVPGREGKPG 1231
Cdd:PHA03169  151 PPESHNPSPNQQPSSFLQPSHEDSPEEPEPPTSEPEPDSPGPPQSETPTSSPPPQSPpdEPGEPQSPTPQQAPSPNTQQA 230

                  ....
gi 568926927 1232 KQGE 1235
Cdd:PHA03169  231 VEHE 234
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1397-1453 1.89e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 38.24  E-value: 1.89e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927  1397 GPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPV 1453
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Glutenin_hmw pfam03157
High molecular weight glutenin subunit; Members of this family include high molecular weight ...
1053-1520 2.09e-03

High molecular weight glutenin subunit; Members of this family include high molecular weight subunits of glutenin. This group of gluten proteins is thought to be largely responsible for the elastic properties of gluten, and hence, doughs. Indeed, glutenin high molecular weight subunits are classified as elastomeric proteins, because the glutenin network can withstand significant deformations without breaking, and return to the original conformation when the stress is removed. Elastomeric proteins differ considerably in amino acid sequence, but they are all polymers whose subunits consist of elastomeric domains, composed of repeated motifs, and non-elastic domains that mediate cross-linking between the subunits. The elastomeric domain motifs are all rich in glycine residues in addition to other hydrophobic residues. High molecular weight glutenin subunits have an extensive central elastomeric domain, flanked by two terminal non-elastic domains that form disulphide cross-links. The central elastomeric domain is characterized by the following three repeated motifs: PGQGQQ, GYYPTS[P/L]QQ, GQQ. It possesses overlapping beta-turns within and between the repeated motifs, and assumes a regular helical secondary structure with a diameter of approx. 1.9 nm and a pitch of approx. 1.5 nm.


Pssm-ID: 367362 [Multi-domain]  Cd Length: 786  Bit Score: 43.01  E-value: 2.09e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1053 RGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAAGERGHSGAKGFLGIPGPSGPPGAKGLPGEPGSQGPQGPVGPPGEMG 1132
Cdd:pfam03157  217 QGQQGQQPERGQQGQQPGQGQQPGQGQQGQQPGQPQQLGQGQQGYYPISPQQPRQWQQSGQGQQGYYPTSLQQPGQGQSG 296
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1133 PKgpPGAVGEPGLPGDSGMKGDLGPLGPPGeQGLIGQRGEPGLEGDHGPVGPDGLKGDRGDPGPDGEHGEKGQEGLKGED 1212
Cdd:pfam03157  297 YY--PTSQQQAGQLQQEQQLGQEQQDQQPG-QGRQGQQPGQGQQGQQPAQGQQPGQGQPGYYPTSPQQPGQGQPGYYPTS 373
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1213 GSPGPPGITGVPGREGKPGKQGEKGQRGAKG---AKGHQGYLGEMGIPGEPGPPGTPGPKGSRGTLGPTGAPGRMGAQGE 1289
Cdd:pfam03157  374 QQQPQQGQQPEQGQQGQQQGQGQQGQQPGQGqqpGQGQPGYYPTSPQQSGQGQPGYYPTSPQQSGQGQQPGQGQQPGQEQ 453
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1290 PGLAGYNGHkgitgplgppGPKGEKGDQGEDGKTEGPPGPPGDRGPVGDRGDRGEPGDPGYPGQEGVQGLRGEPGQ--QG 1367
Cdd:pfam03157  454 PGQGQQPGQ----------GQQGQQPGQPEQGQQPGQGQPGYYPTSPQQSGQGQQLGQWQQQGQGQPGYYPTSPLQpgQG 523
                          330       340       350       360       370       380       390       400
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  1368 QPGHPGPRGRPGPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQgdqgnd 1447
Cdd:pfam03157  524 QPGYYPTSPQQPGQGQQLGQLQQPTQGQQGQQSGQGQQGQQPGQGQQGQQPGQGQQGQQPGQGQQPGQGQPGYY------ 597
                          410       420       430       440       450       460       470
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|...
gi 568926927  1448 gdPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQlgppGKRGTeggtgLPGNQGEPGSKGQPGDSGEMG 1520
Cdd:pfam03157  598 --PTSPQQSGQGQQPGQWQQPGQGQPGYYPTSSLQLGQ----GQQGY-----YPTSPQQPGQGQQPGQWQQSG 659
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1379-1433 2.46e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.86  E-value: 2.46e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1379 GPKGSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPG 1433
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
901-975 2.86e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 2.86e-03
                           10        20        30        40        50        60        70
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*
gi 568926927   901 GPPGDNGPEGMKGKPGARGLPGPPGQLGPegdegpmgppgvpglegqPGRKGFPGRPGLDGSKGEPGDPGRPGPV 975
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGP------------------PGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1496-1550 3.03e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 3.03e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1496 GGTGLPGNQGEPGSKGQPGDSGEMGFPGVAGLFGPKGPPGDIGFKGIQGPRGPPG 1550
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Laminin_G_2 pfam02210
Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G ...
124-206 3.21e-03

Laminin G domain; This family includes the Thrombospondin N-terminal-like domain, a Laminin G subfamily.


Pssm-ID: 460494 [Multi-domain]  Cd Length: 126  Bit Score: 39.33  E-value: 3.21e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   124 KLQLGLQFLPGRTIIHLGPRQsvafdldVHDGRWHHLALELRGRTVTMVTaCGQHRVPVPLPSRRDsMLDPQGSFLLGKV 203
Cdd:pfam02210   29 RLVLRYDLGSGPESLLSSGKN-------LNDGQWHSVRVERNGNTLTLSV-DGQTVVSSLPPGESL-LLNLNGPLYLGGL 99

                   ...
gi 568926927   204 NPR 206
Cdd:pfam02210  100 PPL 102
SPT5 COG5164
Transcription elongation factor SPT5 [Transcription];
647-834 3.27e-03

Transcription elongation factor SPT5 [Transcription];


Pssm-ID: 444063 [Multi-domain]  Cd Length: 495  Bit Score: 42.32  E-value: 3.27e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  647 NPGPPGPPGAKGQKGDPGLSPGQAHDGAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPGRQ 726
Cdd:COG5164    35 STRPAGNTGGTRPAQNQGSTTPAGNTGGTRPAGNQGATGPAQNQGGTTPAQNQGGTRPAGNTGGTTPAGDGGATGPPDDG 114
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  727 GFPGPVGDPGPKG--SRGYIGLPGLFG-LPGSDGERGLPGVPGKRGEMGRPGFPGDFGERGPPGLDG-----NPGEIGLP 798
Cdd:COG5164   115 GATGPPDDGGSTTppSGGSTTPPGDGGsTPPGPGSTGPGGSTTPPGDGGSTTPPGPGGSTTPPDDGGsttppNKGETGTD 194
                         170       180       190
                  ....*....|....*....|....*....|....*.
gi 568926927  799 GPPGVLGLIGDTGALGPVGYPGPKGMKGLMGGVGEP 834
Cdd:COG5164   195 IPTGGTPRQGPDGPVKKDDKNGKGNPPDDRGGKTGP 230
PRK07764 PRK07764
DNA polymerase III subunits gamma and tau; Validated
1343-1535 3.39e-03

DNA polymerase III subunits gamma and tau; Validated


Pssm-ID: 236090 [Multi-domain]  Cd Length: 824  Bit Score: 42.28  E-value: 3.39e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1343 GEPGDPGYPGQEGVQGlRGEPGQQGQPGHPG-PRGRPGPKGSkgeegpkgkPGKAGPSGRRGTQGLQGLPGPRGVVGRQG 1421
Cdd:PRK07764  590 PAPGAAGGEGPPAPAS-SGPPEEAARPAAPAaPAAPAAPAPA---------GAAAAPAEASAAPAPGVAAPEHHPKHVAV 659
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927 1422 PEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLPGAQGPPGFKGESGLPGQLGPPGKRGTEGGTGLP 1501
Cdd:PRK07764  660 PDASDGGDGWPAKAGGAAPAAPPPAPAPAAPAAPAGAAPAQPAPAPAATPPAGQADDPAAQPPQAAQGASAPSPAADDPV 739
                         170       180       190
                  ....*....|....*....|....*....|....
gi 568926927 1502 GNQGEPGSKGQPGDSGEMGfPGVAGLFGPKGPPG 1535
Cdd:PRK07764  740 PLPPEPDDPPDPAGAPAQP-PPPPAPAPAAAPAA 772
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1189-1244 3.69e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 37.47  E-value: 3.69e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*.
gi 568926927  1189 GDRGDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGA 1244
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGP 56
PRK14971 PRK14971
DNA polymerase III subunit gamma/tau;
261-400 4.08e-03

DNA polymerase III subunit gamma/tau;


Pssm-ID: 237874 [Multi-domain]  Cd Length: 614  Bit Score: 42.07  E-value: 4.08e-03
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927  261 LHPEPALLGLGNLTRT--PATLGARPVSRALAVTLAPAMPTKPLRTVHPDVSEHSSSQTPLSPAKQSARKTpspsssaSL 338
Cdd:PRK14971  350 LLVELTLIQLAQLTQKgdDASGGRGPKQHIKPVFTQPAAAPQPSAAAAASPSPSQSSAAAQPSAPQSATQP-------AG 422
                          90       100       110       120       130       140
                  ....*....|....*....|....*....|....*....|....*....|....*....|..
gi 568926927  339 ANSTRVYRPAAAQPRQITTTSPTKRSPTkPSVSPLSVTPMKSPHATqktgvPSFTKPVPPTQ 400
Cdd:PRK14971  423 TPPTVSVDPPAAVPVNPPSTAPQAVRPA-QFKEEKKIPVSKVSSLG-----PSTLRPIQEKA 478
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
763-817 5.86e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 5.86e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927   763 GVPGKRGEMGRPGFPGDFGERGPPGLDGNPGEIGLPGPPGVLGLIGDTGALGPVG 817
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1192-1246 6.04e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 6.04e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*
gi 568926927  1192 GDPGPDGEHGEKGQEGLKGEDGSPGPPGITGVPGREGKPGKQGEKGQRGAKGAKG 1246
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1382-1438 6.53e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 6.53e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927  1382 GSKGEEGPKGKPGKAGPSGRRGTQGLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRP 1438
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
673-724 6.86e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.70  E-value: 6.86e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|..
gi 568926927   673 GAKGNMGLPGLSGNPGPLGRKGHKGHPGAAGHPGEQGQPGPEGSPGAKGYPG 724
Cdd:pfam01391    4 GPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPG 55
DUF5585 pfam17823
Family of unknown function (DUF5585); This is a family of unknown function found in chordata.
271-613 6.97e-03

Family of unknown function (DUF5585); This is a family of unknown function found in chordata.


Pssm-ID: 465521 [Multi-domain]  Cd Length: 506  Bit Score: 41.10  E-value: 6.97e-03
                           10        20        30        40        50        60        70        80
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   271 GNLTRTPATLGARPVSralAVTLAPAMPTkplrtvhpdvsehssSQTPLSPAKQSARKTPSPSSSASLANSTRVYR-PAA 349
Cdd:pfam17823   94 GTDLSEPATREGAADG---AASRALAAAA---------------SSSPSSAAQSLPAAIAALPSEAFSAPRAAACRaNAS 155
                           90       100       110       120       130       140       150       160
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   350 AQPRQITTT-------SPTKRSPTKPSVSPLSVTPMKSPHATQKTGVPSFTKPVPPTQKPAPFTSYLAPSKASS--PTVR 420
Cdd:pfam17823  156 AAPRAAIAAasaphaaSPAPRTAASSTTAASSTTAASSAPTTAASSAPATLTPARGISTAATATGHPAAGTALAavGNSS 235
                          170       180       190       200       210       220       230       240
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   421 PVQKTfmtprppvpspqpLRPTTGLSKKFTNPTVAKSKSKTTSWA-----SKPVLARSSVPKTLQQTVLSQSPVSYLGSQ 495
Cdd:pfam17823  236 PAAGT-------------VTAAVGTVTPAALATLAAAAGTVASAAgtinmGDPHARRLSPAKHMPSDTMARNPAAPMGAQ 302
                          250       260       270       280       290       300       310       320
                   ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 568926927   496 TLAPalpplgvgnprTMPPTRDSALTPAgskkfTGRETSKKTRQKSSPRKPEPLSPGKSARDASPRDLTTKPSRPSTPAL 575
Cdd:pfam17823  303 AQGP-----------IIQVSTDQPVHNT-----AGEPTPSPSNTTLEPNTPKSVASTNLAVVTTTKAQAKEPSASPVPVL 366
                          330       340       350       360
                   ....*....|....*....|....*....|....*....|....
gi 568926927   576 vlapayllssspqpTSSSFPFFHLLGPT--PFPML----MGPPG 613
Cdd:pfam17823  367 --------------HTSMIPEVEATSPTtqPSPLLptqgAAGPG 396
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1132-1182 7.96e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 7.96e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|.
gi 568926927  1132 GPKGPPGAVGEPGLPGDSGMKGDLGPLGPPGEQGLIGQRGEPGLEGDHGPV 1182
Cdd:pfam01391    7 GPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1038-1086 8.11e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 8.11e-03
                           10        20        30        40
                   ....*....|....*....|....*....|....*....|....*....
gi 568926927  1038 RGLPGMRGAKGHRGPRGPDGPAGEQGSKGLKGRVGPRGRPGQPGQQGAA 1086
Cdd:pfam01391    9 PGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1406-1462 8.61e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 8.61e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927  1406 GLQGLPGPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNP 1462
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
Collagen pfam01391
Collagen triple helix repeat (20 copies); Members of this family belong to the collagen ...
1412-1468 9.13e-03

Collagen triple helix repeat (20 copies); Members of this family belong to the collagen superfamily. Collagens are generally extracellular structural proteins involved in formation of connective tissue structure. The alignment contains 20 copies of the G-X-Y repeat that forms a triple helix. The first position of the repeat is glycine, the second and third positions can be any residue but are frequently proline and hydroxy-proline. Collagens are post translationally modified by proline hydroxylase to form the hydroxy-proline residues. Defective hydroxylation is the cause of scurvy. Some members of the collagen superfamily are not involved in connective tissue structure but share the same triple helical structure. The family includes bacterial collagen-like triple-helix repeat proteins.


Pssm-ID: 460189 [Multi-domain]  Cd Length: 57  Bit Score: 36.32  E-value: 9.13e-03
                           10        20        30        40        50
                   ....*....|....*....|....*....|....*....|....*....|....*..
gi 568926927  1412 GPRGVVGRQGPEGTAGSDGIPGRDGRPGYQGDQGNDGDPGPVGPAGRRGNPGVAGLP 1468
Cdd:pfam01391    1 GPPGPPGPPGPPGPPGPPGPPGPPGPPGPPGEPGPPGPPGPPGPPGPPGAPGAPGPP 57
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH