Protein

Protein accession
4cd5S [EnVhog]
Representative
6GxQK
Source
EnVhog (cluster: phalp2_19950)
Protein name
4cd5S
Lysin probability
99%
PhaLP type
endolysin
Probability: 95% (predicted by ML model)
Protein sequence
MKVKLTEQQLVNLAKNIKNNTPTPQVGDIDFEKEAPNLTKLIKTLMSNKDAGINNITKNILPNTDKSVSKSKFARIIPKGNKMMHPLGRKEPITSPFGSRNTGIPGATKNHKGIDISTRSGSPVYAPLDGVVLNSQDTSPNPCGGFIKLDHVNLETKFCHLRRLNVREGDKVKKGQIIGYSGGGRNDPMHGNSSGPHLHYEILDKSGIAMNPVSVEPNLG
Physico‐chemical
properties
protein length:220 AA
molecular weight:23878,1 Da
isoelectric point:9,74
hydropathy:-0,60
Representative Protein Details
Accession
6GxQK
Protein name
6GxQK
Sequence length
219 AA
Molecular weight
22951,69070 Da
Isoelectric point
8,98536
Sequence
MKLKLTEGQLNKLMVTLDEQEAPKKEGGDLMGSLEKEAPALAAFAKFMRDPIGSAVDKLNGGNTDSSSSTFANDIPPGTELMNPLGKRTKVTSGFGPRNVGGSASKNHKGVDLPAVSGSPVYAPADGRVVTAKDTSPNGCGGFVQIDHTGVGLKTKFCHLKRWTVSQGQDVKKGQLIGYSGGGPNDPYRGNSMGAHLHYEVLNSASIAMNPKNVHSDMA
Other Proteins in cluster: phalp2_19950
Total (incl. this protein): 13 Avg length: 220,5 Avg pI: 8,34

Protein ID Length (AA) pI
6GxQK 219 8,98536
274aE 212 5,10956
39JfO 202 8,64916
3nb4v 226 9,19398
4557e 223 9,46120
5JFHW 204 6,19377
5bQGJ 246 8,94049
5lzPN 246 8,78615
5w7Zs 238 8,87325
6B3TI 203 9,04996
6Mk11 221 6,37537
7Ei9J 206 9,04996
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_21771
3Z3Hh
5 38,5% 231 4.079E-54
2 phalp2_38254
7cjl
3 43,9% 189 1.166E-51
3 phalp2_39873
180yL
3 41,0% 134 3.408E-27
4 phalp2_5481
3dacM
12 39,7% 136 7.626E-26
5 phalp2_2246
4s1ni
19 36,8% 160 3.772E-23
6 phalp2_19030
17y4D
9 34,7% 144 3.293E-22
7 phalp2_3247
2k2so
54 36,0% 136 5.317E-21
8 phalp2_21998
4ZChj
22 34,7% 141 6.277E-20
9 phalp2_17286
4a6ed
5 38,1% 144 5.424E-19
10 phalp2_18623
3ebV
83 35,4% 141 2.168E-17

Domains

Domains
Disordered region
PET_M23
Representative sequence (used for alignment): 6GxQK (219 AA)
Member sequence: 4cd5S (220 AA)
1 219 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated
Pfam accessions: PF01551

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (6GxQK) rather than this protein.
PDB ID
6GxQK
Method AlphaFoldv2
Resolution 77.22
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50