How to Engineer Proteins

Simplify Complexity: One-Step Protein Evolution

No more blind trial-and-error in the lab.MatwingsVenus™ acts like a "virtual laboratory," simulating thousands of evolutionary possibilities in advance and delivering only the optimal solutions.

Based on built-in tools, the assistant can predict the effects of single or multiple amino acid mutations on protein structure and function. It supports single-point mutation prediction using state-of-the-art AI models, and can also perform training-based predictions using user-provided wet-lab data.

engineer

1. Case Study (How to Upgrade Your Protein?)

1.1 Follow the Prompted Dialogue

1.1.1 Input Requirement

We aim to enhance the catalytic activity of UniProt ID P00698 (Lysozyme) toward its substrate peptidoglycan.

Reference Link

屏幕截图 2026-04-24 095257

The assistant outlines required additional inputs and describes the workflow.

微信图片_20260427140642_41_25

1.1.2 Additional Information

Obtain the highest-quality structural model available. Apply multi-model cross-validation. First analyze active sites and highly conserved residues. I can process 50–200 samples per run.

屏幕截图 2026-04-24 095451

Once complete information is provided, the assistant begins execution.

PixPin_2026-04-26_23-53-18

After processing, the assistant successfully completes the task.

屏幕截图 2026-04-24 101816

You can proceed with further steps based on the assistant’s suggestions.

屏幕截图 2026-04-24 102005

1.2 Direct Instruction Input

Provide all requirements at once, and the assistant will directly perform prediction and output results.

“We aim to enhance the catalytic activity of UniProt ID P00698 (Lysozyme) toward peptidoglycan. Obtain the best structural model available. Apply multi-model cross-validation. Analyze active sites and conserved residues first. I can process 50–200 samples per run. No restricted mutation sites. Directly output final results.”

Reference Link

微信图片_20260427142234_44_25

2. What Can MatwingsVenus™ Do?

  1. Protein structure prediction and visualization (3D structure generation):

    1. ESMFold (Fast Mode): Fast with moderate accuracy (≤500 amino acids), charged per run

    2. AlphaFold (High-Accuracy Mode): Slower but highly accurate (≤2000 amino acids), charged by runtime

  2. Functional site and risk analysis: identify active sites, conserved residues, and binding sites

  3. Single-point mutation prediction: perform saturation mutagenesis prediction using AI models

  4. Iterative optimization: incorporate wet-lab data for more accurate multi-site mutation prediction


3. Input Tips (How to Improve Accuracy?)

3.1 Provide Detailed Protein Information

  • Upload protein 3D structure files (PDB format)

  • Input UniProt ID or amino acid sequence

  • Use built-in large-scale protein (metagenomic) datasets

engineer-3.1-en

3.2 Specify Clear Engineering Goals

  • Improve thermostability

  • Enhance catalytic efficiency or other properties

  • For enzymes, specify substrates and binding sites

3.3 Provide Experimental Throughput

The system will return a corresponding number of predictions based on your throughput capacity.

3.4 Upload Wet-Lab Data for Higher Accuracy

engineer-3.4-en

3.5 Choose Multi-Model Cross-Validation

  • Enabled by default for higher accuracy

  • Optional single-model mode to reduce credit consumption

  • You may specify models based on Section 5

3.6 Specify Constraints in Advance

You may also provide initial requirements, and the assistant will guide you on additional inputs needed.


4. Mode Selection

The platform provides three working modes:

  • Fast Mode: Lightweight intelligent retrieval with efficient output

  • Thinking Mode: Handles complex tasks with deeper reasoning and integration

  • Thinking Mode Pro: Advanced reasoning and cross-domain problem solving


5. Model Overview

5.1 Structure-Based Models (Higher Performance)

  1. VenusREM SeriesZero-shot mutation prediction. Predict mutation effects without experimental data using homologous structural retrieval. [Paper] [Code]

  2. VenusPrime SeriesFew-shot mutation prediction. Fine-tune with small experimental datasets for improved accuracy. [Paper] [Code]

  3. VenusX SeriesProtein site prediction. Identify active, binding, and functional residues for rational design. [Paper] [Code]


Platform design and commercial copyright belong to Tianwu