Protein

Protein accession
1qwlk [EnVhog]
Representative
3TCAn
Source
EnVhog (cluster: phalp2_31497)
Protein name
1qwlk
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MSITTDFISSIKAQAIANRAAGSCLASVCIAQAICESASGTSGLTQVSNNLFGIKGDYNGAYVDYPTKEWVNGQYVSTTAKFRKYPSMTESIADHMNFLKRIKLASGALRYQAVLDAADYKTATQALKDAGYATSPAYPDTLNKIIETYNLAQYDTTASADAIEVDGVRGPLTIKQWQQVIGTFADGVISKPVSQLIKADQAFLNSAIGAGLKVDGSEGPLTIKARQRYLGVDADGVLGPVTNKAHQAALNRAAIGSGRY
Physico‐chemical
properties
protein length:260 AA
molecular weight:27562,8 Da
isoelectric point:8,62
hydropathy:-0,11
Representative Protein Details
Accession
3TCAn
Protein name
3TCAn
Sequence length
311 AA
Molecular weight
34854,09460 Da
Isoelectric point
9,16046
Sequence
MNNTEFIKTIGMMAQKDMQTSHILASLTIAQAILESGWGRSALSQAPNYNLFGIKGEYNGQYCLFNTQEYVNGKWITIKDKFRKYPSWLQSIQDHSALFNRYDRYANLRGNYNYRDVCIKVREDGYATDPSYSSKLINLIETYNLTQYDTETAPASEDEYIVQAGDTLNKISGMFNVSVDDLVKWNNIKNKNLIYVGQVLKVKGNSTPAPTPTAQGTYVVKAGDTLSKIASKYGTTYQELARINNIANPNLILVGQVLKVPTNNTETTYVVKAGDNLTKIAKQFNTSVDSLVAKNNIKNKNLIYAGQVLKI
Other Proteins in cluster: phalp2_31497
Total (incl. this protein): 6 Avg length: 324,8 Avg pI: 7,89

Protein ID Length (AA) pI
3TCAn 311 9,16046
8aUO0 437 9,91209
oq2v 313 5,27275
orbp 318 8,73490
Q9ZXE4 310 5,66329
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_11359
7wNq4
16 48,7% 285 1.688E-88
2 phalp2_33491
7wA0y
13 40,7% 277 1.168E-59
3 phalp2_21235
1gIfg
3 38,6% 277 1.530E-56
4 phalp2_2210
4f9X6
94 35,8% 215 7.168E-39
5 phalp2_33707
MX3g
10 32,6% 279 1.849E-36

Domains

Domains
Representative sequence (used for alignment): 3TCAn (311 AA)
Member sequence: 1qwlk (260 AA)
1 311 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01476, PF01832

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
1qwlk
Method AlphaFoldv2
Resolution 84.82
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (3TCAn) rather than this protein.
PDB ID
3TCAn
Method AlphaFoldv2
Resolution 84.04
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50