Protein

Protein accession
1pFeP [EnVhog]
Representative
4fGQC
Source
EnVhog (cluster: phalp2_18130)
Protein name
1pFeP
Lysin probability
98%
PhaLP type
VAL
Probability: 68% (predicted by ML model)
Protein sequence
MASGKIGTLKPQDRLAVFQSAKNLGLDPYEFGALITKESGFRPNVWGGTGGKYRGLIQFGPGARQEVGLPSQDMSIAEQMPYVEKYFQKRGFKPGMDITQAYATVLGGNPKANIYAKDSFGTSVASAVPGMKKGGSLYKQAQATLGDPLNLSTPVVQQPAQQAQPTTASPSQTIVLLGDSKDGDDETNTNIGNYFLQSFLGQPEKASSAFRSNREALFKGLLSSIAGTTESPEYFSQTTM
Physico‐chemical
properties
protein length:240 AA
molecular weight:25624,5 Da
isoelectric point:8,94
hydropathy:-0,43
Representative Protein Details
Accession
4fGQC
Protein name
4fGQC
Sequence length
220 AA
Molecular weight
23598,95100 Da
Isoelectric point
5,41695
Sequence
MPLSPEDIAYLRAAAARDGYNPDHALKVFDHESSFRPNVWGGKDNKYFGLFQAGPSERAQYGVDVAHPSVRNQVDAFGRFLHGRGFKPGMGLDDMYSTVLAGSPGHYNRSDGAGTVAQHVAAMNGGKMPNVLGGGMDPVALAALAYSGDEQPAAGGDDPIGKLLQGEQQPQENPIAGLLTQISEQEKQTQEEEQQRRDALQKQILDLHNAQHGLAVKGLL
Other Proteins in cluster: phalp2_18130
Total (incl. this protein): 122 Avg length: 227,4 Avg pI: 8,09

Protein ID Length (AA) pI
4fGQC 220 5,41695
11QQm 227 5,57780
14fsH 241 7,85381
15gT9 231 8,77655
15lwM 210 5,07381
16pWT 226 9,39493
16r0A 216 9,61509
1PdFb 231 9,09489
1TL0c 224 8,75095
1TM0H 252 5,27082
1UQ7o 224 9,19656
1WPli 217 7,88076
1aBtB 243 9,32672
1pRFp 212 5,72530
1rVyb 220 9,55481
1yXqr 226 5,58121
216jI 272 5,54188
22hfE 278 9,44509
2Ii3d 231 8,97956
2IqL9 237 9,15408
2KHB7 219 9,61457
2Mef6 230 8,80053
2OY2f 228 9,13396
2a9jk 218 9,58047
2jKF9 269 5,92003
2qUef 272 5,54188
2s6ud 263 8,82638
2tZlG 225 9,65055
2vfJi 254 4,81451
2xjgp 269 5,14418
3CqGJ 233 7,79811
3D8Q5 241 5,32481
3EtSw 224 9,04506
3FBuw 224 9,24440
3LZPA 193 7,87908
3UxCS 225 9,32743
3oa6H 170 5,69535
3pjzO 184 5,19727
42BwA 244 9,52142
46IaI 220 6,83247
46mVm 230 9,04577
46nKu 210 6,35701
46nND 213 5,41718
498X2 210 6,35513
49Lki 222 8,75218
49sd 259 9,44006
4Ar7J 225 9,35876
4BW2M 226 9,09612
4GiQj 211 8,75095
4RHGR 210 9,51910
4aWCs 220 6,83349
4aYaB 224 6,75272
4aYlQ 220 7,87902
4bgn9 240 9,23982
4e25Y 229 8,75018
4jsGz 228 9,48390
4nyfD 210 5,57439
4pDG8 234 9,44077
4t1l8 242 9,24820
4t2dv 237 9,25851
4tqPB 218 9,61406
53QwW 185 9,51858
58VEL 221 8,79982
5921z 213 9,09657
5BwvM 217 9,48390
5Cpx3 200 9,78754
5HDhX 219 7,87032
5a6sH 222 8,80150
5cXyN 227 9,12313
5cYOX 217 9,39338
5cbYY 213 8,77629
5hKfw 215 9,73622
5j7sa 227 9,09612
5kzlL 211 9,86419
5mlkK 221 5,53574
5moBx 224 9,29971
5mulk 222 5,10195
5mvaN 221 5,71001
5v0S3 217 5,23836
5wrhg 210 6,35701
5xrC0 210 5,09842
6FQq 240 9,36237
6GQCa 207 8,92328
6GYbp 232 9,04545
6Gg8m 221 6,75250
6HKqz 216 5,24211
6L3XN 246 9,13403
6LO9L 219 7,87599
6LWj0 215 9,55526
6LYgF 221 9,78335
6MADu 217 9,65222
6MgFj 224 10,04380
6Mi1Z 223 9,91596
6OkaK 272 5,29685
6VHas 224 9,29971
6xqMQ 226 8,80214
7CMMe 232 7,81971
7IMl1 239 9,20391
7JVCY 242 9,25858
7TZDy 224 9,06930
83iIf 233 7,79850
84E3j 221 9,73042
8fkHs 225 7,85252
8gv2k 231 7,75945
8nl6L 240 9,25858
8tGXY 249 7,82996
8vKeE 222 8,80130
AjkK 233 9,62708
Bv7p 222 7,89681
D4KH 272 5,30759
EwIY 223 9,21983
Jjhv 227 9,85975
SFo0 225 9,32820
cWAv 206 5,45674
iqfB 231 9,09683
ixqg 252 9,31525
qLLy 242 8,90439
qM6s 230 6,60977
qRXA 226 5,07717
swWV 246 8,80537
xyx 232 6,16722
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_2880
qLSY
21 32,9% 173 1.293E-37
2 phalp2_17784
97m5
34 38,1% 194 4.743E-32
3 phalp2_27255
3YXF6
5 30,3% 168 3.887E-23
4 phalp2_34686
2FGtp
1 22,4% 183 2.094E-07
5 phalp2_12203
6Ja4c
19 25,3% 150 1.293E-05

Domains

Domains
Unannotated
Disordered region
Representative sequence (used for alignment): 4fGQC (220 AA)
Member sequence: 1pFeP (240 AA)
1 220 AA (representative)
Domain positions follow the representative sequence above; the member sequence bar is scaled to the same axis.
Legend: EAD CBD Linker Disordered Unannotated

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (4fGQC) rather than this protein.
PDB ID
4fGQC
Method AlphaFoldv2
Resolution 77.92
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50