Protein

Protein accession
3gnSR [EnVhog]
Representative
2SCDc
Source
EnVhog (cluster: phalp2_21638)
Protein name
3gnSR
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VIVKPPIEMRADWGALPPKSNPGGFTALNKTVIHYTAANKGYPRSNHSDCREQVRSIQRQHQNIPEQSDIEYNALACNHGTLFEGRVKGYKGGANGSAETNKTMPSICCLLGVGDEPTYEMLNAVAWFHMQVEKAASTSWLEAIGHCDIYATSCPGDPMYALIEVDFIHAMADQPTPTPPGDDDMPAPDIVQINEPWGPFPTGAVFICSPDCMTFRWVKSEDELWQLQQTFPRRGWTFPNPIPPIPSSWLAQYGELIGATP
Physico‐chemical
properties
protein length:261 AA
molecular weight:28831,3 Da
isoelectric point:5,01
hydropathy:-0,38
Representative Protein Details
Accession
2SCDc
Protein name
2SCDc
Sequence length
287 AA
Molecular weight
31190,51800 Da
Isoelectric point
5,37807
Sequence
MPWPQPPIEYRADWGAIPPKSNPGGFTDLVATVCHYTAANRGYMVPESADHERCRSQVRSIQRQHQSISNQSDIEYNALACSHGCLFEGRVLGYKGGANGSADSNKTMPSVCCLVGVDDVPTDAMLAAVGWFHQRVEERAGRTLDMKKHKEITSTSCPGVLLSTWVDNDKFHDSPQPEPEPPQPQPHPPQPTPPGGDMARIGPYLIQATGKDGTPNGRVYATDGNFMTLRWLETTEALEGYRWQLTQYGCSAPELAPGAPIEPIDTISAFGVVISDEGGTKSGKKND
Other Proteins in cluster: phalp2_21638
Total (incl. this protein): 2 Avg length: 274,0 Avg pI: 5,19

Protein ID Length (AA) pI
2SCDc 287 5,37807
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_36432
11r0n
207 37,9% 187 7.197E-29
2 phalp2_11750
42dFf
4 35,4% 223 1.741E-24
3 phalp2_39895
1eEek
97 34,8% 192 6.704E-23
4 phalp2_13951
8FOsh
28 34,4% 186 1.392E-21
5 phalp2_20947
7lyTA
33 31,5% 263 1.173E-17
6 phalp2_23216
7DAyW
82 32,2% 177 2.332E-16
7 phalp2_15057
7s7pP
172 30,3% 224 5.703E-16
8 phalp2_40673
5klYy
3 28,1% 181 2.881E-13
9 phalp2_39086
2ckB5
17 31,6% 193 5.455E-12
10 phalp2_5539
3EPBj
12 29,3% 194 9.803E-12

Domains

Domains
Unannotated
Representative sequence (used for alignment): 2SCDc (287 AA)
Member sequence: 3gnSR (261 AA)
1 287 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3gnSR
Method AlphaFoldv2
Resolution 74.70
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (2SCDc) rather than this protein.
PDB ID
2SCDc
Method AlphaFoldv2
Resolution 69.67
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50