Protein

Protein accession
4IlWv [EnVhog]
Representative
7uctk
Source
EnVhog (cluster: phalp2_24844)
Protein name
4IlWv
Lysin probability
88%
PhaLP type
VAL
Probability: 99% (predicted by ML model)
Protein sequence
MATINGDLQRHTNLLGGIPNRWKMVGAAVASVLPALIPLSAAALQTAGAFTTMSVAVGAAVGAYVAALKGAISKTMEMAKAGKALSPVQKEFVSSVNGMKSAWAQFISSTQNKTLATATNVVQGLTSGIKLLKPLVDAIHPSLLQISKDFEAWTKGDSARNYSNLLSAIAKTVMPSLSQAGKDVLNVLGDGFRAFAPLTKGMAESIAAGAAALKSWSDGGGFQRFLTYVRDNAPAVQAFWKAFREALANVFNTLHQFSGGVLHVLTDSLRAIAAINPESVKAFANAMLLLRAPILFLVLNCPPLRDAMIALLGAMNPTIIYAVAAAFIALRIAFMIMNTALLTTPLGWIIMGLVALGVAIAIIATKTTWFQTAWTYTWNFIKTVFSTVYTFLTTGLGQLVLLILGPVGALLILLNNWSTVWNAIKAFGSMIWQALQTAWHATINAIVTAWNAVAGALSAAWNAVWNALKTAAQFVWTALQLAWSLFITGLQAIWTAVSGALSTAWNAVWNAIKTAAQFVWTSLQLAWSLFITGLQAIWSTVSGALSAAWNAVWNVIKGAAMAVWAFLQTAWHNFIVGVQVIWSTVSSALSSAWNTVWNAIKGAAMAVWAFLQTAWNTFLTAVRTAYNTFASAVTGSWNLFWNAIKTAAQTVWNWLKTAWQALVDAVRTVYNTFASALKGSWDLFWNAIKNAAKAVWDWIKEKWEQLCTGVRVIYNDFSQKLQDSWDLLWKTLKDGAHKWWELIKKVIADAINWLLKPINLLIKGFNNIASALNMDISIPEIKVEFANGGMVGGVPHFAAGGVANFSNGGAVPGYAPGRDTVPAVLSRGEGVLVPEAVRGLGGANFVHGANHYFSKGRAPRRGSRWSPGGRGCGPQGFADGGMVGGVQHFAVGGMTAAALAKAGVSLGMVSQGEYSDGSLSGGTHLGGGAVDISSTSPAVLAALKAAGFAAWIRGPAEGMSPHIHAVLMGHPELSAAAQAQVASFLAGGSGLGVGGGGGGGFSPIDFIKNMFGASVGEILRKIAKFGGSLADYMTGGGGGDDDGGGGGIFDSVVGTIADIANPFNDDNPVADKAAEVAGGAMSGIVGMFLAAIDRSAVEMAFDWIGFDDSPAGMMGDMMAEMGSQSIEGAIDWVTNKAEEMAPDMSAMMGGGDVSTWAPLAAMALAMAGISANQLPAFLALMAAESGGNPLAVNNWDSNAAMGQASQGLMQVIPSTFAAYRDPSLPNNILDPLANMVASARYIRAVYGGLVPGSPYANGTPGATRGVHLVGEQGPELMNFAGGETVTPAKETAGIMGGGAGGALPEDMMDRFKLLMTLSQQMATTVQTSWGTVQASGRTAQSALAPVFSDIATQIGTDIPTAMATMSAANTACWAAMQTGTTTAWTAMRDGTFLEANDHMAVKMPQWGGEMCAAVSEAWVNMGDATATAWEAMKEGTRGPTNWIIENSYNDGIARLWNEVAAVIFEDGARTLPSVATLAKGGPVHGPGGERSDRVPAMLSRGEHVWTAREVHRAGGHHAVQAMRSGVMGGRDVRGASRGCGFALGGWSDPGGLGLPSLGELKRGSMMDEAEPFLDALDSGVKASLATSPYRRMGGSAGALSPSDWMRQYIKKDDELNKLGSGGPPWIPGVSDAITSWGGVTVNQRTADMLNMAIALGASFSATQGSFSTGVAASAGTHDGGGVVDLVPTGDGNVGALRAVGFAAWNRGAAYGSPSFSDHIHAVALGDPTVSPGAAAQVMSYLAGGNGLADGGPDNFSGGIPGGASGVLPAGGGREVDPADLEQRTVSFDRGGLLQPGYTLAYNGTGKPEPVGHNLEPRGSRGDVHIEMPITISGNTDAQAVVDGINNQVLPKLRQYLNQGTGTNC
Physico‐chemical
properties
protein length:1864 AA
molecular weight:193933,2 Da
isoelectric point:6,41
hydropathy:0,20
Representative Protein Details
Accession
7uctk
Protein name
7uctk
Sequence length
2094 AA
Molecular weight
217699,48170 Da
Isoelectric point
9,24975
Sequence
MATVTSLGFSIFSRYDGNGVSQARRDIAGFRNELVAADRSIVSATRRFQGLQVAAVAIAPALLPVATVAAANAGALAAMGVSAGTALGVFGAAMGGAIKNTLSLRDSVQQLKANLDQQRATLATLEPGTEAYAKQLLKVLQAQQEYDAALRRMTPAQRAFLQSLDALTASWQRFISQTQNQSLGVATTVLAAMAVAVGKLKPLFDAVVPSMQGVANAIAAWLKGDGFERFIQVVMTQGVPSLNALIAAGRSMATVLGEAFRAFAPLGTELANSLARGAAELAKWSTEGGFIRFIREARAQTPAVREMLDALWAAFKNILAAMQQLAPVSMTLITVLAQIVAAIPPGVIAAIAQAFVAWRVAILGMMVIQGVAALFTAFVHVLSLLRGAVMVAITVWRAFNLAFIATPIGAVITAIVALVAAFVILWNKCEWFREFWINWWNNIKQEASTAWQWIQMAWNAVGSAMVTAWQAVSGALVTAWNATWSGISTAVQAIWTGLTTAWNAVVNGFQVVWSAVGGALSAAWSAVWGAIQVAATAVWNALQTGWQVFVGTLQTIWSAVGGALSAAWSAVWNAISVAATAIWSALQAAWSAFITGLQTIWSTVSGALSAAWSAVWNAISAAAMAVWNALQTAWSAFITGLQTIWSTVSGALSATWSAVWNAISAAATTIWSALQAAWSAFVSALQTVFSAVSGALSAAWSAFWSGVQSAAQTVWSALQAAWQAFLSALQSAYQTISSALSAAWSAFWSGLQTAAQAVWSAIQTAWSAVLTAMQTAWSAFSSALTAAWNAFWTAVRTAAQAVWTALQTAWSAFLTAIQTAYTTFSAALTAAWNLFWNAIRTAAQTIWTALQTAWTAFLNLLRTAYNTWSAALTAAWNAFWTALRAAAQAIWNALSASWQALLNLLRNTYNTWSAALRAAWQAFWTAVRTAAQTIWNAMSASWQALLTALRNAWNTFSSAIRTAWNATWNALRDIARTIWNQIGGIIERAINGVIGIVNALIKGFNNVTSFLSIDVKIGEIGTVNFPTLATGGIVTFAYGGMTGKPCPEMQGYAAGGPVNLRKGGTLRGYAPGKDTVPAILSKGEGVLTPEAVRGLGGPGFVNGANRKFAGHRGAGRGAPSLDKFGVPHFAVGGMTSAALARAGVPMGLISQGEYSHGSLSAGTHAGGGAVDISSTSPAMLARLHAAGFAAWIRGPEHGMSPHIHAVLMNHPELSGPARAQVASFRAGGSGLGSGGGGAGGGGGIPSFLQSILSKAGEILSRVAQGLPLGSLLDGLSGLFGGGGGGGGSEEKEDGGGLFGSGIGPDFGPDITPGKDLADAASKVIGGAAKVAGGVAGALLPDLGSIGKMLLGLIPDDAFEFAFKWVKDRLSGWGNPAGNFGKILIAMGEKVIKGAIDFLIGKNKEAEASAMAQFASFSVAGAQSVQSWAPLARQAMVMGGLDPSQLPAFLQRMQIESTGNPNAINNWDINARNGIPSQGLMQIIPPNFQKYHVPGTSNNILDPLANMAAAAAYIKDRYGGRVPTGKAYALGTPGALPGPALVGENGPEIVNMRGGDTVTPAKDTAQILSNAAAPALPPVPATGMPQTTPLQSPIIDQLKAAPEGEGWFGDIIAAAQHMLATAQTAWNGTVQAGVTATPAITTTTDLVGKKVGADIPAKLDLMAGSSQANWSAMNASALTNWNAMLATVFTPAEQHQGTTMPLTATQMNTASNLAWTNMNAQSAAQWVLMRDSTFTEAELHQGTTMPTMATTMQTASDTAWTTMNATSAEQWTGIRDGQVVPFETHMQTTMPEAATAMNEAVGAAFTAMVETIVAQLDTAIAKIEEFIAATEAAIAAAEALAAAQAAAASSGAGGSLGAANGSAAAALSAAGISSGMIVQGPYSNSVAASAGTHSGGGVYDIAGGPELLPALHAAGFAAWYRDWPGNQHIHAVYSGASDLSPQAQWQLDDFRRGGDGLGIPGGLASGTSGASRGWSWIGERGPELMKLRGGERIKSNKWARRYTAGAQRGTARLLDSFRPADFASRFMDSVDVSVCRTERSDSRPIPVVRSDGGETTITIPITVQGNLDHEAVREIENEVIPKLRSLMQQGVGKGR
Other Proteins in cluster: phalp2_24844
Total (incl. this protein): 18 Avg length: 1656,5 Avg pI: 8,78

Protein ID Length (AA) pI
7uctk 2094 9,24975
4XR2o 2544 6,18001
4XRC6 2544 6,17905
4kQXo 1769 6,59147
6TJg3 2047 5,62293
7uix1 1736 9,89585
e7eq 1567 7,64384
A0A2Z4QB97 1365 9,99880
A0A2Z4QBH6 1365 9,99139
A0A2Z4QC02 1366 10,04529
A0A2Z4QC25 1366 10,12652
A0A345L2B1 1366 10,00731
A0A386KQX1 1366 10,02917
A0A5J6TDR9 1365 9,99880
A0A5J6TDY2 1366 10,04413
A0A9E7NGG3 1366 10,05173
A0A9E7NHP4 1361 9,95187
Similar Clusters (pHMM search)
# Cluster # Members Identity (%) Alignment Length E-value
1 phalp2_5099
1ICO0
13 29,0% 1640 5.266E-188
2 phalp2_7749
6XVyZ
2 21,5% 1525 8.368E-92
3 phalp2_23798
1dPcB
56 20,5% 1504 1.040E-63

Domains

Domains

No domain annotations available.

Taxonomy

  Name Taxonomy ID Lineage
Phage Unknown from Metagenome
[NCBI]
UNKNOWN_ENVHOG No lineage information
Host No host information

Coding sequence (CDS)

Coding sequence (CDS)

No CDS data available.

Gene Ontology

No Gene Ontology terms available.

Enzymatic activity

No enzymatic activity data available.

Tertiary structure

No tertiary structures available for this protein.

The structures below correspond to the cluster representative (7uctk) rather than this protein.
PDB ID
7uctk
Method AlphaFoldv2
Resolution 47.53
Chain position -
Model Confidence
Very high
pLDDT > 90
High
90 > pLDDT > 70
Low
70 > pLDDT > 50
Very low
pLDDT < 50