NCBI Home Page NCBI Site Search page NCBI Guide that lists and describes the NCBI resources
Conserved domains on  [gi|20198152|gb|AAD23615|]
View 

expressed protein [Arabidopsis thaliana]

Protein Classification

Graphical summary

 Zoom to residue level

show extra options »

Show site features     Horizontal zoom: ×

List of domain hits

Name Accession Description Interval E-value
DUF4057 super family cl16198
Protein of unknown function (DUF4057); This family of proteins is functionally uncharacterized. ...
7-326 3.06e-107

Protein of unknown function (DUF4057); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 279 and 322 amino acids in length.


The actual alignment was detected with superfamily member pfam13266:

Pssm-ID: 404197  Cd Length: 299  Bit Score: 314.79  E-value: 3.06e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152     7 NPHHSTADLLSWSEIRRPDYSTA----ANRSNQPSDGMNDVLGGGgQITNAETKSLNTnvshRKNCSGHKLKEMTGSDIF 82
Cdd:pfam13266   7 KPHTSTADLLTWSETPPPDSPAAsapsAARSHQPSDGISKVVFGG-QVTDEEAESLNK----RKPCSGYKLKEMTGSGIF 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152    83 SDDGKYDPnhqtrihyhqdqlsqisfsGEENATTPMNGKDDPNHQTRIhyhqDQRSQISFSGEENVTPKKPTTLNEAAKQ 162
Cdd:pfam13266  82 AANGEDDA-------------------SESGSANPNNKTSVRMYQQAV----NGISQISFSEEESVSPKKPTSLPEVAKQ 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152   163 KELSRTVETQADSKCKKkQISNTKNKAMSGHDIFA-SPESQPRRLfgGATQSEVKGNKNTEESAPRSSRASVKTSN--GQ 239
Cdd:pfam13266 139 RELSGTLESESDSKLKK-QISDAKSKELSGHDIFApPPEIPPRPL--AARNLELKESKDMGEPAPRNVRTSVKVSNpaGG 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152   240 SSNRLFSEEHVVKSSKKIHNQKsqFQGLTSNGIFKSDkIPPGYSEKMQSSAKKREMSGHNIFADGKSEYRDYYGGARRPP 319
Cdd:pfam13266 216 QSNILFGEEPVVKTAKKIHNQK--FAELTGNDIFKGD-APPGSAEKPLSTAKLKEMSGSDIFADGKAESRDYLGGVRKPP 292

                  ....*..
gi 20198152   320 GGESSIS 326
Cdd:pfam13266 293 GGESSIA 299
 
Name Accession Description Interval E-value
DUF4057 pfam13266
Protein of unknown function (DUF4057); This family of proteins is functionally uncharacterized. ...
7-326 3.06e-107

Protein of unknown function (DUF4057); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 279 and 322 amino acids in length.


Pssm-ID: 404197  Cd Length: 299  Bit Score: 314.79  E-value: 3.06e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152     7 NPHHSTADLLSWSEIRRPDYSTA----ANRSNQPSDGMNDVLGGGgQITNAETKSLNTnvshRKNCSGHKLKEMTGSDIF 82
Cdd:pfam13266   7 KPHTSTADLLTWSETPPPDSPAAsapsAARSHQPSDGISKVVFGG-QVTDEEAESLNK----RKPCSGYKLKEMTGSGIF 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152    83 SDDGKYDPnhqtrihyhqdqlsqisfsGEENATTPMNGKDDPNHQTRIhyhqDQRSQISFSGEENVTPKKPTTLNEAAKQ 162
Cdd:pfam13266  82 AANGEDDA-------------------SESGSANPNNKTSVRMYQQAV----NGISQISFSEEESVSPKKPTSLPEVAKQ 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152   163 KELSRTVETQADSKCKKkQISNTKNKAMSGHDIFA-SPESQPRRLfgGATQSEVKGNKNTEESAPRSSRASVKTSN--GQ 239
Cdd:pfam13266 139 RELSGTLESESDSKLKK-QISDAKSKELSGHDIFApPPEIPPRPL--AARNLELKESKDMGEPAPRNVRTSVKVSNpaGG 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152   240 SSNRLFSEEHVVKSSKKIHNQKsqFQGLTSNGIFKSDkIPPGYSEKMQSSAKKREMSGHNIFADGKSEYRDYYGGARRPP 319
Cdd:pfam13266 216 QSNILFGEEPVVKTAKKIHNQK--FAELTGNDIFKGD-APPGSAEKPLSTAKLKEMSGSDIFADGKAESRDYLGGVRKPP 292

                  ....*..
gi 20198152   320 GGESSIS 326
Cdd:pfam13266 293 GGESSIA 299
 
Name Accession Description Interval E-value
DUF4057 pfam13266
Protein of unknown function (DUF4057); This family of proteins is functionally uncharacterized. ...
7-326 3.06e-107

Protein of unknown function (DUF4057); This family of proteins is functionally uncharacterized. This family of proteins is found in eukaryotes. Proteins in this family are typically between 279 and 322 amino acids in length.


Pssm-ID: 404197  Cd Length: 299  Bit Score: 314.79  E-value: 3.06e-107
                          10        20        30        40        50        60        70        80
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152     7 NPHHSTADLLSWSEIRRPDYSTA----ANRSNQPSDGMNDVLGGGgQITNAETKSLNTnvshRKNCSGHKLKEMTGSDIF 82
Cdd:pfam13266   7 KPHTSTADLLTWSETPPPDSPAAsapsAARSHQPSDGISKVVFGG-QVTDEEAESLNK----RKPCSGYKLKEMTGSGIF 81
                          90       100       110       120       130       140       150       160
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152    83 SDDGKYDPnhqtrihyhqdqlsqisfsGEENATTPMNGKDDPNHQTRIhyhqDQRSQISFSGEENVTPKKPTTLNEAAKQ 162
Cdd:pfam13266  82 AANGEDDA-------------------SESGSANPNNKTSVRMYQQAV----NGISQISFSEEESVSPKKPTSLPEVAKQ 138
                         170       180       190       200       210       220       230       240
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152   163 KELSRTVETQADSKCKKkQISNTKNKAMSGHDIFA-SPESQPRRLfgGATQSEVKGNKNTEESAPRSSRASVKTSN--GQ 239
Cdd:pfam13266 139 RELSGTLESESDSKLKK-QISDAKSKELSGHDIFApPPEIPPRPL--AARNLELKESKDMGEPAPRNVRTSVKVSNpaGG 215
                         250       260       270       280       290       300       310       320
                  ....*....|....*....|....*....|....*....|....*....|....*....|....*....|....*....|
gi 20198152   240 SSNRLFSEEHVVKSSKKIHNQKsqFQGLTSNGIFKSDkIPPGYSEKMQSSAKKREMSGHNIFADGKSEYRDYYGGARRPP 319
Cdd:pfam13266 216 QSNILFGEEPVVKTAKKIHNQK--FAELTGNDIFKGD-APPGSAEKPLSTAKLKEMSGSDIFADGKAESRDYLGGVRKPP 292

                  ....*..
gi 20198152   320 GGESSIS 326
Cdd:pfam13266 293 GGESSIA 299
 
Blast search parameters
Data Source: Precalculated data, version = cdd.v.3.21
Preset Options:Database: CDSEARCH/cdd   Low complexity filter: no  Composition Based Adjustment: yes   E-value threshold: 0.01

References:

  • Wang J et al. (2023), "The conserved domain database in 2023", Nucleic Acids Res.51(D)384-8.
  • Lu S et al. (2020), "The conserved domain database in 2020", Nucleic Acids Res.48(D)265-8.
  • Marchler-Bauer A et al. (2017), "CDD/SPARCLE: functional classification of proteins via subfamily domain architectures.", Nucleic Acids Res.45(D)200-3.
Help | Disclaimer | Write to the Help Desk
NCBI | NLM | NIH