Protein

Protein accession
4SY6J [EnVhog]
Representative
4SW3E
Source
EnVhog (cluster: phalp2_6026)
Protein name
4SY6J
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VSVYQYPTINRIIPDVPKNGYRHGVGEYEGVVVHFTDNYTSTVNGEIAYMTSNWRNAFVHGFVSGSNFYQTADTNYSCWGCGPVGNPRFVQFEICVSHTQEDFNQTIDVAAEKAARFLAAKKLGVKKASEVGAAGTLWAHYDVSHILGGSDHEDPIDYFAKWGLTWNAFCAKVETYYNGILGINQPAPVKTHLPLPTGIIRQGNKGDNVKQLQTALNSIGYPCTADGDFGPKTKAALVAFQKNVSISADGIYGPTTRAYMLKYV
Physico‐chemical
properties
protein length:264 AA
molecular weight:28936,2 Da
isoelectric point:7,19
hydropathy:-0,24
Representative Protein Details
Accession
4SW3E
Protein name
4SW3E
Sequence length
294 AA
Molecular weight
32850,21230 Da
Isoelectric point
5,97931
Sequence
MSVYHYPVRKDWIPYVPKNPYADGIGAYRGVVLHYTENEQDTAASESQYEHQNWQSAFVHEFVDHKEVVQTADPNYRCWGCGAKGNPYYVQIEKCSSHSQSEFDTSFDAWCERAAEYLYRRKLGVIPANDTNKGVGATLLGHFQISKYMGGTDHTDPLSHLAKWNKTWDDVVNRVKAIYNALAAEEVANQAAAVENQRLYLTNLLADPKSTYGTKGWAYSQLKTLPISTVRSGDSNTVVKELQAVLTYLGFNCNGIDGIFGNGTYNAVIAFQRQIGLIVDGVVGNSTWAALFSY
Other Proteins in cluster: phalp2_6026
Total (incl. this protein): 3 Avg length: 269,0 Avg pI: 5,88

Protein ID Length (AA) pI
4SW3E 294 5,97931
6x2xI 249 4,47297
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_35269
6SkGy
26 42,2% 187 3.250E-57
2 phalp2_22283
7wMS5
3 40,0% 180 1.848E-47
3 phalp2_22248
7gCZr
4 31,8% 286 4.877E-40
4 phalp2_2331
4MNkD
32 28,0% 324 5.252E-36
5 phalp2_7313
4e1GY
5 26,1% 302 4.017E-32
6 phalp2_5528
3sIWi
1 31,0% 187 3.920E-26
7 phalp2_36672
45lpM
229 30,6% 251 8.251E-25
8 phalp2_8663
7V0AW
3 30,0% 216 7.320E-21
9 phalp2_21830
4kiLL
4 29,3% 201 1.640E-18
10 phalp2_17898
EGF4
130 27,6% 318 9.878E-18

Domains

Domains
Representative sequence (used for alignment): 4SW3E (294 AA)
Member sequence: 4SY6J (264 AA)
1 294 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01471, PF01510

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4SW3E) rather than this protein.
PDB ID
4SW3E
Method AlphaFoldv2
Resolution 90.80
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50