
We have covered de novo proteins (Part 2) and de novo small molecules (Part 3). Now we turn to biologics—specifically, therapeutic antibodies.
Antibodies are the fastest-growing and one of the most effective classes of drugs, valued for their exquisite specificity. But as any biologics team knows, they are "messy". They are massive, complex proteins.
The "Old Way" of discovering them is slow, expensive, and analog:
Both methods are resource-intensive. Worse, the "humanization" process—grafting the mouse-derived binding loops (CDRs) onto a human scaffold—is a "patch." It is a slow, error-prone process that often fails, resulting in candidates with high immunogenicity (the tendency to be rejected by the human immune system).
We skip this entire messy process. We design de novo antibodies in two stages, entirely in silico.
An antibody's binding specificity is defined by its six Complementarity-Determining Region (CDR) loops. The target's binding surface is the "epitope."
Using the 3D structure of our target epitope (the "lock"), we use our generative models (from Part 2, such as RFdiffusion or SOTA binder-design models like OriginFlow) to de novo design CDRs that are a perfect complementary match.
This single in-silico step allows us to generate a library of 10^6 or more antibody variants, all specifically "dreamed up" to bind the target epitope, often with predicted affinities higher than those found in nature.
This is the central problem in biologics. A perfect binder is not a drug. The primary reason biologics fail in development is "poor developability".
We define "Developability" as a set of critical, non-negotiable biophysical properties:
The "Old Way" is painfully sequential:
This multi-year, billion-dollar cycle is the bottleneck. Our platform shatters this with multi-objective co-generation.
Just as we did with small molecules in Part 3, we optimize for all properties simultaneously at the moment of creation. The "Old Way" is sequential filtering; our way is simultaneous, guided generation.
SOTA research explicitly describes a "guidance approach" that integrates this property information directly into the diffusion model's sampling process. This allows the simultaneous design of CDRs (for binding) "while considering critical properties like solubility and folding stability".
Our generative process is a multi-modal, guided process. As our diffusion model generates the antibody sequence and structure, it is guided at each step by a panel of predictive "developability" models.
Our multimodal "Developability Prompt" looks like this:
Generate(Antibody) WHERE reward =
(w1 * Binding_Affinity_Score) +
(w2 * SITA_Immunogenicity_Score) +
(w3 * SAbPred_TAP_Score) +
(w4 * CamSol_Solubility_Score)
We are not just designing binders; we are designing drug candidates. We optimize for low immunogenicity, high stability, and solubility from the first amino acid. This means the candidates that come out of our in-silico platform are already de-risked and optimized for manufacturing and clinical success.
We have now designed a novel protein (Part 2), a small molecule (Part 3), and an antibody. But these are all, for now, just hypotheses. How do we test them at scale?
Next in Part 5, we will show how we test all 1,000 designs in one day in our "Virtual Lab."
#antibodyDesign #biologics #developability #therapeuticAntibodies #CDRdesign

Ryan previously served as a PCI Professional Forensic Investigator (PFI) of record for 3 of the top 10 largest data breaches in history. With over two decades of experience in cybersecurity, digital forensics, and executive leadership, he has served Fortune 500 companies and government agencies worldwide.

Partner with the platform: Two tangible first projects to accelerate your R&D pipeline today.

The self-driving laboratory flywheel that connects AI theory to experimental facts through active learning.

Testing 1,000 drug candidates in one day using Digital Twin systems biology simulations and proteome-wide toxicity screening.