Protein

Protein accession
3np9y [EnVhog]
Representative
4H0aV
Source
EnVhog (cluster: phalp2_20604)
Protein name
3np9y
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MKPLILILAFLCIAASPRDVTDTLIRMDRLLGDTEALGAVPTLKQDGRVTSGFGRPRPGHKHAGLDIVNAPGTPIRAVGGGTVTHSGWEPGGYGINIVVHYGNLDVLYGHLRLAYVRVGDKVKRGQVIGEMGATGRCSPVGATHLHLEYRINGVPCDPMRFIIGNE
Physico‐chemical
properties
protein length:166 AA
molecular weight:17726,3 Da
isoelectric point:9,15
hydropathy:0,01
Representative Protein Details
Accession
4H0aV
Protein name
4H0aV
Sequence length
177 AA
Molecular weight
18881,59970 Da
Isoelectric point
9,28959
Sequence
MKRLILILLLSPALLGSRTFETIDLILDLARSREAVRELARAVDLGSVPSIAPAPGRISSRFGSRVHPVWRCRSFHSGLDIANAEGTPVCAVGGGVVVFAGGGRRYSGYGNVVVIEHSPRVRTMVAHLGAVFVAEGDRVRRGDVVGTMGSTGISTGPHVHYEIIVEGIHVDPEAWIL
Other Proteins in cluster: phalp2_20604
Total (incl. this protein): 4 Avg length: 176,3 Avg pI: 8,93

Protein ID Length (AA) pI
4H0aV 177 9,28959
3ndVO 181 8,96402
4ALGA 181 8,31534
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21998
4ZChj
22 50,7% 126 4.127E-40
2 phalp2_15404
1Utrm
46 41,7% 134 7.903E-34
3 phalp2_19030
17y4D
9 42,1% 121 8.794E-32
4 phalp2_29593
84pla
1 39,5% 144 2.019E-27
5 phalp2_2246
4s1ni
19 39,8% 128 4.637E-26
6 phalp2_14848
7w0BH
74 42,7% 117 1.623E-25
7 phalp2_27075
2pChT
5 42,7% 124 3.716E-24
8 phalp2_35838
4GGll
4 41,6% 120 5.082E-24
9 phalp2_985
2pMee
2 44,2% 122 4.540E-23
10 phalp2_3247
2k2so
54 41,7% 127 6.206E-23

Domains

Domains
Representative sequence (used for alignment): 4H0aV (177 AA)
Member sequence: 3np9y (166 AA)
1 177 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3np9y
Method AlphaFoldv2
Resolution 81.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4H0aV) rather than this protein.
PDB ID
4H0aV
Method AlphaFoldv2
Resolution 88.35
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50