Protein

Protein accession
3yQN2 [EnVhog]
Representative
4kiLL
Source
EnVhog (cluster: phalp2_21830)
Protein name
3yQN2
Lysin probability
99%
PhaLP type
endolysin
Probability: 99% (predicted by ML model)
Protein sequence
MTAIEERIVANGHGALSPSYFCVHTTANTGATAENHASLWSRQPDYAVHLVSDWEKCLHTVPYDRLCWQVGNGNGYVEGIEICEAASRAEFEEGIAIAAKAVAERLKAHGWGTDRLVTHKWCAEKWGGSDHTDPYPYFEKWGYSWERFVSAVESAMSGGEWVEDGGRWWYKHADGSYTSDGWEEIDGRWYLFDADGWMLTGWQQRGGSWYYLNESHDGGFGAMLTGWQLVGGKWYFLDESGAMATGWKSDGSKWYYMDGDGAMQTGWVQDGGKWYWMNPDGSMSADEVKSVGDTFYAFDESGKMLSHALKVQGVDA
Physico‐chemical
properties
protein length:316 AA
molecular weight:35436,6 Da
isoelectric point:4,75
hydropathy:-0,54
Representative Protein Details
Accession
4kiLL
Protein name
4kiLL
Sequence length
309 AA
Molecular weight
34922,25980 Da
Isoelectric point
4,92535
Sequence
MIRIRGVMNIIEDIVSWGHGELYPSYVVIHETANPGATARNHRDYWASNDTYAVHYCGDWTGDVYHCVPDDRICWQVGNGNPYVVGIELCHATNQEDFESVWRVGIEWSAMMLNRYGWGIDRLISHNDCTNWWGGSDHTDPISYFENHGKSWEQFKQEVADYMASGGFEEEAMYDVNVPAAGTPVHRLFNAGSGEHFYTCAEDEKDGLVANGWAYEGVAWRCPDPEVAVFRMWMPGGKHFFTASFDEAQGLTKNRWKCEGVPFFAKREGTPVRRAFNKYTGDHLLTTSDTDMANAIAAGYADEGVAFHV
Other Proteins in cluster: phalp2_21830
Total (incl. this protein): 4 Avg length: 313,3 Avg pI: 6,88

Protein ID Length (AA) pI
4kiLL 309 4,92535
3VGNP 329 8,82071
A0A9E6YZE7 299 9,03023
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_14323
3WMg9
12 23,5% 285 9.491E-18
2 phalp2_25741
4Lci3
6 20,1% 218 8.079E-16
3 phalp2_32244
7rT4s
46 26,9% 208 3.972E-12
4 phalp2_30009
2eOPi
1 25,4% 212 5.411E-11
5 phalp2_39033
8owcJ
65 23,3% 248 2.289E-08

Domains

Domains
Representative sequence (used for alignment): 4kiLL (309 AA)
Member sequence: 3yQN2 (316 AA)
1 309 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01510, PF18885

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

PDB ID
3yQN2
Method AlphaFoldv2
Resolution 94.44
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50

The structures below correspond to the cluster representative (4kiLL) rather than this protein.
PDB ID
4kiLL
Method AlphaFoldv2
Resolution 89.61
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50