Protein

Protein accession
1p1Nk [EnVhog]
Representative
6RoCS
Source
EnVhog (cluster: phalp2_6291)
Protein name
1p1Nk
Lysin probability
99%
PhaLP type
endolysin
Probability: 89% (predicted by ML model)
Protein sequence
MLPIDTTTYAFTKNYRKRTQPIKRIILHWDVGVDVTGDKIIDAADTYKILLNRGLSTHFTVDNDGTIYQMLNPLLVAYHAGMWNGTSIGVDICNIVQPINDKTKKHYKDNWGDRPIIASMKLNGAKYTNFFGFYDKQISQTLRLIDYLCDTYNIKKIIPSKLYETCISSLEWEGVCGHYHVEKKKWDPLSFPFEKIKIEPPQATRQS
Physico‐chemical
properties
protein length:207 AA
molecular weight:23928,3 Da
isoelectric point:8,88
hydropathy:-0,40
Representative Protein Details
Accession
6RoCS
Protein name
6RoCS
Sequence length
238 AA
Molecular weight
26135,55100 Da
Isoelectric point
8,63588
Sequence
MFMSDLKPKANIIGPERAAVLGTDKLIVASAEYAASGFNLINFNDQDGFSFHKNPALFRNRTQKIDRLVLHWDGSRSSKGCFNALLARGLSVHLMLDRDGTVYQALDLYSAVASHASGANGRSVGIEICNIVDVREAHKEPDRIITQSFFPSGWKPKHLDFTEAQKKALVPLVQLICGTVGIPPRIPDEKNLPKNGFVSDGWSGVCGHYHIPNKNGKWDPGTTLWPVLKQAGFVEEGI
Other Proteins in cluster: phalp2_6291
Total (incl. this protein): 2 Avg length: 222,5 Avg pI: 8,76

Protein ID Length (AA) pI
6RoCS 238 8,63588
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_30132
3gxUn
18 38,1% 199 1.870E-40
2 phalp2_13009
40SH2
16 38,6% 163 2.658E-30
3 phalp2_9065
5sRvP
49 36,3% 187 1.190E-24
4 phalp2_570
PDRe
48 32,2% 223 1.392E-17
5 phalp2_40230
2zpAj
1 27,5% 258 1.592E-16
6 phalp2_22012
59QfH
56 28,7% 188 2.038E-14
7 phalp2_30246
4fbkp
14 30,1% 222 1.247E-13
8 phalp2_26666
2qS81
37 27,5% 207 1.870E-12
9 phalp2_13010
40VUd
2 24,5% 232 6.208E-12
10 phalp2_34043
800Zh
5 25,6% 179 2.772E-11

Domains

Domains
Disordered region
Ami2
Representative sequence (used for alignment): 6RoCS (238 AA)
Member sequence: 1p1Nk (207 AA)
1 238 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1p1Nk
Method AlphaFoldv2
Resolution 91.18
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6RoCS) rather than this protein.
PDB ID
6RoCS
Method AlphaFoldv2
Resolution 90.59
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50