Protein

Protein accession
7TKdQ [EnVhog]
Representative
4FfqV
Source
EnVhog (cluster: phalp2_13447)
Protein name
7TKdQ
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MNPRFSKWTEDNVAICRYLHENAREVFLELVLRIAEQGIYIRLTSGHRTAEEQKKEYDEGDSWVLCPDSYHCHGLAVDIVPMERVSDLLYRAIWGKDAWERGVYEKMAKVAYKLGIAWGYQEWGVDRGHFHYRDNKTIYQIAEGNFPKKPEIAQIPYHRETRRVIDRLQNRDIITPVLFPYLYVKSR
Physico‐chemical
properties
protein length:187 AA
molecular weight:22236,1 Da
isoelectric point:7,74
hydropathy:-0,59
Representative Protein Details
Accession
4FfqV
Protein name
4FfqV
Sequence length
186 AA
Molecular weight
21099,21970 Da
Isoelectric point
9,56106
Sequence
MKVALSKFTAANFAKLRYLRPEAQRLFTEFFCRIAMAGIYIRIPDDGGKRTTEEQLDQYRKGRPWVAGYDPKRHNVPPVTDVKCPYSWHCHGLAVDIAPLTRLSTLLYSVWYGEAPFNTIARIAKELGITWGYAIWGQDKPHFHYSGGLTLEQVAAGASLPMPQFKPAEKPKVLQRALERLNVPLP
Other Proteins in cluster: phalp2_13447
Total (incl. this protein): 3 Avg length: 183,0 Avg pI: 9,01

Protein ID Length (AA) pI
4FfqV 186 9,56106
2rRQq 176 9,72488
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_714
1GW8q
36 30,7% 156 4.010E-17
2 phalp2_1337
15l6q
1516 34,0% 150 1.889E-16
3 phalp2_35414
f7cN
32 31,2% 147 2.658E-14
4 phalp2_26137
aKO9
109 30,4% 164 3.619E-14
5 phalp2_6580
1dlN9
3 30,0% 150 6.706E-14
6 phalp2_29857
41cMd
95 28,2% 156 2.300E-13
7 phalp2_627
1bpyM
18 31,1% 170 4.257E-13
8 phalp2_25286
7YOmj
5 28,2% 152 4.257E-13
9 phalp2_2464
5pI5e
1 28,1% 153 2.692E-12
10 phalp2_34371
4aKCB
348 28,6% 143 2.305E-11

Domains

Domains
Representative sequence (used for alignment): 4FfqV (186 AA)
Member sequence: 7TKdQ (187 AA)
1 186 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF13539

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
7TKdQ
Method AlphaFoldv2
Resolution 93.75
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4FfqV) rather than this protein.
PDB ID
4FfqV
Method AlphaFoldv2
Resolution 94.07
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50