Protein

Protein accession
AGsn [EnVhog]
Representative
2STSP
Source
EnVhog (cluster: phalp2_17143)
Protein name
AGsn
Lysin probability
98%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
KYGSGSDGQESGDPEGSFPAANCRTGDATKDAIINALYSAGLKTPNALAGALGNLQAESGFDHNVHNTSRQGVTCVTVSGTPEKCYGLVQWGGSRKTEIIAKCGQTSTLQCQLEFMVQEVKKRGGGVVEGMNSASSASAAAEIWRKKYEVGSGGIAERQQYAEQIVKTITCNS
Physico‐chemical
properties
protein length:173 AA
molecular weight:18084,9 Da
isoelectric point:6,90
hydropathy:-0,43
Representative Protein Details
Accession
2STSP
Protein name
2STSP
Sequence length
172 AA
Molecular weight
18363,47720 Da
Isoelectric point
6,11556
Sequence
MAAPTWVAVLALLFGMATMDSSFAARRDAAMAFFASQGWTHNQAAGIVGNLQAESGIDHTRAQMNGGPGYGLAQWEGPRQAMFKTWAGRDIRESTFDEQLRFIQFELTGPEIGAGRALKAAKSAGAAGAIVCRLYERPADTDGQAVYRCGLAEKIARETQFNNVIAGSESTA
Other Proteins in cluster: phalp2_17143
Total (incl. this protein): 4 Avg length: 174,3 Avg pI: 7,07

Protein ID Length (AA) pI
2STSP 172 6,11556
5BBKy 178 9,77703
XZgW 174 5,49510
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_1063
7aR4F
3 44,6% 150 2.548E-28
2 phalp2_16088
4aMjC
1 51,3% 111 1.100E-26
3 phalp2_35267
6RVpS
9 42,6% 136 7.946E-24
4 phalp2_30734
7sLSx
1 39,6% 126 2.923E-18
5 phalp2_2583
6CHLx
1 33,6% 107 1.228E-16
6 phalp2_18657
8z6mC
13 36,4% 107 5.819E-16
7 phalp2_27338
4vg8B
32 35,6% 143 8.355E-14
8 phalp2_5269
8e5oF
72 33,0% 118 2.884E-13
9 phalp2_17643
6M3yO
1 40,5% 116 1.354E-12
10 phalp2_834
7WW8c
6 33,6% 119 1.845E-12

Domains

Domains
Disordered region
Phage_lys2
Representative sequence (used for alignment): 2STSP (172 AA)
Member sequence: AGsn (173 AA)
1 172 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF18013

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (2STSP) rather than this protein.
PDB ID
2STSP
Method AlphaFoldv2
Resolution 88.63
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50