Protein

Protein accession
30FTv [EnVhog]
Representative
6SdDJ
Source
EnVhog (cluster: phalp2_29073)
Protein name
30FTv
Lysin probability
96%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
VGSNLVAEAPSIDTEVLAVKVSKYSPDYPELIENLNCCENYCRWDQKKIIDSNNKYSYGGLMFQMTTFLHYGHEGGILPEWVDETNFENYIYDKDLQTLIAEYMITIGVAKTTVGWYNCWRSYNLSQYL
Physico‐chemical
properties
protein length:129 AA
molecular weight:14964,7 Da
isoelectric point:4,56
hydropathy:-0,33
Representative Protein Details
Accession
6SdDJ
Protein name
6SdDJ
Sequence length
123 AA
Molecular weight
14226,00100 Da
Isoelectric point
4,77012
Sequence
MLPLAISLMMSATFTPSVLTPQIAEVATTTPQDIWVQHLHECENPTNIPRILDTNGQYSYGYVSFQMPTWLYYGKQFGATEENIKDDELQKQIAEYILQTKGWTDWWNCGRVTIKWYGSYPLP
Other Proteins in cluster: phalp2_29073
Total (incl. this protein): 10 Avg length: 136,0 Avg pI: 6,44

Protein ID Length (AA) pI
6SdDJ 123 4,77012
1NUeo 121 9,56268
1l4R6 152 5,63970
2V2rd 136 7,70944
4K3Jl 134 4,80815
5IY4k 150 4,96309
7ZKOA 105 5,07205
8kSm3 152 8,46020
qEkT 158 8,86577
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_12892
2v9z8
3 42,1% 95 4.944E-26
2 phalp2_22614
1MavD
11 34,2% 105 3.000E-24
3 phalp2_17923
UMZJ
12 32,0% 103 1.209E-21
4 phalp2_34952
4Dsud
16 31,5% 111 7.557E-17
5 phalp2_16707
Ilfh
7 31,8% 91 3.319E-15
6 phalp2_25299
8rpez
24 30,1% 93 1.453E-13
7 phalp2_33867
1FC4O
1 30,7% 117 1.991E-13
8 phalp2_30772
8yRd
3 23,8% 126 4.180E-11
9 phalp2_25243
41xBv
4 24,7% 105 4.633E-09
10 phalp2_34389
4fOxg
1 23,4% 94 4.633E-09

Domains

Domains
Disordered region
Transgly
Representative sequence (used for alignment): 6SdDJ (123 AA)
Member sequence: 30FTv (129 AA)
1 123 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF06737

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
30FTv
Method AlphaFoldv2
Resolution 87.77
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (6SdDJ) rather than this protein.
PDB ID
6SdDJ
Method AlphaFoldv2
Resolution 89.31
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50