Protein

Protein accession
1TpmQ [EnVhog]
Representative
3KeKt
Source
EnVhog (cluster: phalp2_22675)
Protein name
1TpmQ
Lysin probability
99%
PhaLP type
endolysin
Probability: 98% (predicted by ML model)
Protein sequence
MKKFMLFLGVLLFGVTSVNTAQVGQIWEPPIVPSIYKEKQNIPLEEHLETVAPVIDPNELECMAKNIYFEAAIESTAGKIAVAQVTMNRVRSRHYPDSICKVVYQGKHHSNGFPVRDRCQFSWYCDGKGDEPRPTPAWKDSVEIAEYIIRTPSLLDITDGATHYHADWMEKFPKWAHQKKKLVKIDTHIFYKKRNNFNF
Physico‐chemical
properties
protein length:199 AA
molecular weight:23012,2 Da
isoelectric point:8,58
hydropathy:-0,41
Representative Protein Details
Accession
3KeKt
Protein name
3KeKt
Sequence length
149 AA
Molecular weight
16257,17940 Da
Isoelectric point
5,22262
Sequence
MQSIKTALLTMLLFITMGVSTAGSGSTGLGDWYTVHPKFTTSNGLMDVATSSVEGNIFLDMAELECMAKNIFFEAAVESTAGRLAVAQVTLNRVSSDQYPNSVCGVVYEGPHHASGHPKRDMCQFSWYCDGKHDEPQEGRLWRSSQELA
Other Proteins in cluster: phalp2_22675
Total (incl. this protein): 24 Avg length: 188,2 Avg pI: 8,18

Protein ID Length (AA) pI
3KeKt 149 5,22262
1TN2h 199 8,57612
1TqLL 190 9,11108
1W3r2 209 8,75747
3Lghf 150 9,03313
3X0rF 199 8,61583
3oN0U 199 7,70728
40xSY 201 8,28253
4vcGm 105 6,53509
57QzU 179 9,05595
5I95p 207 6,99645
6RGb4 156 6,06986
8ACYl 209 8,75747
8CYQY 209 9,02262
8G4sy 199 8,57747
8eC1z 209 8,90503
8uTdy 150 9,03017
8yiHp 199 7,70728
8yl2R 201 7,04317
8z0XF 199 8,58585
V3ux 203 8,55839
ibkH 198 8,56728
vS8k 198 8,93772
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_22664
28zhl
1 44,7% 134 1.438E-38
2 phalp2_9593
1r3GZ
4 27,0% 144 2.347E-12
3 phalp2_1335
14NLd
89 25,2% 111 4.848E-08

Domains

Domains
Representative sequence (used for alignment): 3KeKt (149 AA)
Member sequence: 1TpmQ (199 AA)
1 149 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF07486

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (3KeKt) rather than this protein.
PDB ID
3KeKt
Method AlphaFoldv2
Resolution 81.72
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50