Protein

Protein accession
5nNqk [EnVhog]
Representative
7csSp
Source
EnVhog (cluster: phalp2_13823)
Protein name
5nNqk
Lysin probability
93%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKKLVTSLAIASSLLMTSPAFADYTVQSGDTLSQIAENEHITLSDLIRLNPQISNPNLIYVNDVIKTQDVANPPTEVRATYSATDLDLLSRLVESEAGGEPYAGKIAVAETVMNRTVSGYFPKTIEGVIYEPGQFQPVANGWINKPASDDSIRAAKQVLATYAPDPNGSLYFYAPSKSSDSWIRTRQVVEVIGKQVFCK
Physico‐chemical
properties
protein length:199 AA
molecular weight:21675,3 Da
isoelectric point:5,04
hydropathy:-0,15
Representative Protein Details
Accession
7csSp
Protein name
7csSp
Sequence length
164 AA
Molecular weight
17891,54440 Da
Isoelectric point
4,80530
Sequence
VLAELNPQIEDIDLIYPDQEVKTEQSGKAAVPGSDEKANVTAGQEKETFRNHELLAQLVEAEAKGETFKGKVAVAEVVLNRVEHSTFPDSVEAVIYQEGQFSPVSNGSINKPASEESKRAVHEAMGSEDITQGSLFFHNPAESNSSYMHAKQPIVIIGNHEFSK
Other Proteins in cluster: phalp2_13823
Total (incl. this protein): 6 Avg length: 199,2 Avg pI: 4,84

Protein ID Length (AA) pI
7csSp 164 4,80530
3Y97W 221 4,50338
4LgVd 201 5,93310
7rSrr 205 4,09163
7wh8n 205 4,64644
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_7898
8IIiG
19 42,4% 172 2.287E-46
2 phalp2_34576
4ZO4j
9 48,2% 112 7.854E-36
3 phalp2_807
3x3gW
4 34,7% 141 1.209E-33
4 phalp2_10779
3PvFf
4 38,7% 147 7.326E-26
5 phalp2_3540
4yvcs
35 36,4% 137 2.080E-23
6 phalp2_17705
71EMV
1 35,1% 145 1.365E-22
7 phalp2_26540
88MeS
20 40,7% 113 5.860E-21
8 phalp2_1335
14NLd
89 39,5% 124 8.753E-19
9 phalp2_15462
87Yrl
67 33,3% 132 1.196E-18
10 phalp2_19006
135l5
10 34,7% 115 1.635E-18

Domains

Domains
Representative sequence (used for alignment): 7csSp (164 AA)
Member sequence: 5nNqk (199 AA)
1 164 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7csSp) rather than this protein.
PDB ID
7csSp
Method AlphaFoldv2
Resolution 82.68
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50