Protein

Protein accession
7tpml [EnVhog]
Representative
6Dsfp
Source
EnVhog (cluster: phalp2_5975)
Protein name
7tpml
Lysin probability
89%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSALDIIPLPGQPGKWGRRVLVEAWIEAGSPPVNDGGAGRLYGLQKYFWDGWADRLPGFNPADNPDDESQRLAHVRFGALDITPTPERVRRLENAGLIRPYKYEPWHFELPNIRNYPIVRALPAAATGSAGAQEGIIMAEAIVSAPNGIVVHLRTGGKTNFTKPDEYNTFRDQVAFLRNIGATDLMPLPELAKVPEVSWDTFNFLCAYMGAPNK
Physico‐chemical
properties
protein length:214 AA
molecular weight:23653,6 Da
isoelectric point:5,95
hydropathy:-0,31
Representative Protein Details
Accession
6Dsfp
Protein name
6Dsfp
Sequence length
202 AA
Molecular weight
22982,58950 Da
Isoelectric point
5,16186
Sequence
MTTLYPIPWQPRYLLTRDTLDLLGAASERAGHDIEVVDAFRVYDEQKALYNGYINHIPGFNLASNPDDPNAQNNHLRAAAVDIKNHADVPYMLAVGFTQDSVEWWHFNNPNWRNMPIIHDYTTVAALAAQPIQEEEMAQKLYEIISAHESASKYLRHPGGVSPIKNTVELAVVERFVSLVNEEKPIDMFDTQLQTINAIQKR
Other Proteins in cluster: phalp2_5975
Total (incl. this protein): 7 Avg length: 209,6 Avg pI: 6,33

Protein ID Length (AA) pI
6Dsfp 202 5,16186
4HSt 202 9,59252
6QheX 214 6,31512
7gTuT 214 6,30909
7olku 207 4,68100
7psvi 214 6,30818
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11278
6RB3h
6 47,6% 191 9.613E-57
2 phalp2_13546
7RceN
12 30,7% 166 5.374E-42
3 phalp2_24160
2gwu3
4 36,2% 185 1.461E-30
4 phalp2_37114
1fUNl
3 28,9% 152 5.756E-09

Domains

Domains
Unannotated
Representative sequence (used for alignment): 6Dsfp (202 AA)
Member sequence: 7tpml (214 AA)
1 202 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6Dsfp) rather than this protein.
PDB ID
6Dsfp
Method AlphaFoldv2
Resolution 74.12
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50