AlphaFold 3 and Protein Structure Prediction Explained

AlphaFold 3 and Protein Structure Prediction Explained

AlphaFold 3 and Protein Structure Prediction Explained

For fifty years, determining the three-dimensional shape of a protein required months of painstaking laboratory work. In 2024, DeepMind and Isomorphic Labs published AlphaFold 3 in Nature — a system that predicts not just protein shapes but the full molecular complexes that actually run biology: proteins docked with DNA, RNA, small-molecule drugs, metal ions, and cofactors, all at once. The AlphaFold 3 protein structure prediction engine is built on a generative diffusion architecture, a fundamental departure from the transformer-plus-frame approach that made AlphaFold 2 famous.

This matters because structural biology sits at the root of almost every drug discovery project. If you know the precise atomic arrangement of a target protein bound to a ligand, you can design molecules that fit it — or disrupt it — with far greater precision than blind screening allows.

What this post covers: the unsolved protein folding problem; why structure matters for function; how AlphaFold 2 worked and where it fell short; exactly what changed in AlphaFold 3 (Pairformer, diffusion module, unified biomolecule tokenisation); what the confidence scores actually mean; the access and openness debate; and — critically — the real limitations practitioners must understand before treating predictions as ground truth.


Why Protein Structure Is Still the Hardest Problem in Biology

Protein structure prediction is hard because the number of possible shapes a polypeptide chain can adopt is astronomically large, yet evolution has encoded the correct fold in the amino acid sequence alone. The physical answer exists, but finding it computationally was intractable for decades.

Proteins are polymers of amino acids. The linear sequence — the “primary structure” — folds through a hierarchy of intermediates: local repeating patterns (alpha helices and beta sheets form the secondary structure), the overall 3D fold of a single chain (tertiary structure), and the arrangement of multiple chains together (quaternary structure). Every level above the first is determined, in principle, by the sequence and by the physics of chemical bonds, electrostatics, and hydrophobicity.

The protein folding hierarchy: from amino acid sequence to 3D structure to molecular function

Figure 1: The folding hierarchy — a one-dimensional sequence encodes secondary motifs that pack into a tertiary 3D shape that directly determines biological function.

The challenge is scale. A protein of just 100 amino acids has more possible backbone conformations than there are atoms in the observable universe — Levinthal’s paradox. Evolution solves this by exploring a funnel-shaped energy landscape, not a flat search space. For decades, computational methods tried to approximate that energy minimisation but failed at realistic protein sizes.

The CASP (Critical Assessment of Protein Structure Prediction) benchmarks, held biannually since 1994, tracked incremental progress for twenty-six years. By CASP13 in 2018, the best methods barely crossed median template modelling scores (TM-scores) of 0.5 on hard targets. AlphaFold 2 entered CASP14 in 2020 and achieved TM-scores close to experimental accuracy on most targets, a result the organisers called “a stunning advance”. It effectively solved the single-chain protein structure prediction problem.

Understanding why AlphaFold 2 worked — and why it still left major problems open — is the necessary context for AlphaFold 3.

For related work on the experimental side of structural biology, see our explainer on cryo-EM reaching sub-ångström resolution, which shows how AI predictions are validated and refined with actual electron-microscopy data.

The AlphaFold Database and the Proteome-Scale Impact

Before AlphaFold, the Protein Data Bank held experimental structures for roughly 200,000 unique protein sequences — a small fraction of the hundreds of millions of sequences deposited in UniProt. The structural coverage of biology was lopsided: well-funded targets in cancer and infectious disease had multiple crystal structures at various states, while the vast majority of proteins — bacterial enzymes, plant signalling proteins, hypothetical ORFs in newly sequenced genomes — had no structural data at all.

AlphaFold 2 changed that at scale. DeepMind and EMBL-EBI jointly released the AlphaFold Protein Structure Database, which as of 2024 hosts predictions for more than 200 million protein sequences across hundreds of organisms. For the human proteome, virtually every protein with a known sequence now has a structural prediction with per-residue confidence annotations. For neglected tropical disease research, where funding for expensive experimental structure determination is scarce, AlphaFold predictions have opened structure-guided drug design for parasitic targets that previously had no structural data.

This is the genuinely transformative scientific contribution — not that each individual prediction is perfectly accurate, but that structural biology’s coverage of sequence space expanded by orders of magnitude in a single release. Biology moves at the pace of hypotheses, and structural hypotheses can now be formed for nearly any protein of interest. The downstream cost in experimental validation remains, but the bottleneck has shifted from “we have no structural starting point” to “we need to validate the structural model before committing resources”. That is a profound change in scientific workflow.

AlphaFold 3’s contribution to this trajectory is extending the same philosophy — broad, accessible coverage — from single proteins to biomolecular complexes. The AlphaFold Server allows researchers without computational infrastructure to submit custom complexes and receive predictions within hours. A laboratory studying a novel transcription factor can now get a predicted protein-DNA complex structure on the same day they identify a candidate binding site, rather than waiting months for collaborators with crystallography expertise.


How AlphaFold 2 Worked — and What It Couldn’t Do

AlphaFold 2 solved the single-protein folding problem by combining evolutionary information with a novel attention architecture called the Evoformer, but it was blind to the molecular context that actually matters in biology: everything that isn’t a protein.

The Evoformer and MSA-Based Reasoning

The key insight in AlphaFold 2 was that evolution is a natural experiment. If two positions in a protein co-evolve across thousands of organisms, they are likely in physical contact in the folded structure — because a mutation at one position is compensated by a mutation at the other to maintain the interface. Building a Multiple Sequence Alignment (MSA) of homologous proteins across the tree of life, and then running deep attention across the rows (sequences) and columns (positions) of that MSA, extracts this co-evolutionary signal.

The Evoformer is a stack of 48 transformer-style blocks that passes information between a sequence-level representation and a pairwise residue-distance representation simultaneously. After the Evoformer, a Structure Module used “invariant point attention” (IPA) to iteratively update rigid-body frames (each residue is a mini coordinate frame) until they converged on a consistent 3D arrangement.

The output confidence metric is pLDDT — predicted Local Distance Difference Test — a per-residue score from 0 to 100. Regions above 90 are highly confident and structurally reliable; regions below 50 are disordered or unreliable. The PAE (Predicted Aligned Error) matrix captures relative positional confidence between pairs of residues, which is especially useful for assessing domain-domain orientation.

The Limits AlphaFold 2 Left Open

AlphaFold 2 was trained and designed for single protein chains or simple homo-oligomers. It could not natively handle:

  • Small-molecule ligands (the drugs and cofactors that actually make proteins do things).
  • Nucleic acids (DNA and RNA, which proteins constantly regulate and are regulated by).
  • Metal ions and post-translational modifications.
  • Heteromeric complexes where the interaction surface is the clinically relevant part.

Practitioners began forcing AlphaFold 2 into these use cases by concatenating chains or stripping ligands — but these workarounds introduced systematic errors. The system had no learned representation for non-protein chemistry. AlphaFold 3 was designed from first principles to address exactly this gap.

A subtler limitation of AlphaFold 2 was its static, single-structure output. Proteins are not rigid objects. Enzymes change shape during their catalytic cycle; kinases toggle between active and inactive conformations; allosteric proteins shift geometry when a distant binding event propagates a structural signal across the chain. AlphaFold 2 predicts a single ground-state structure — typically the lowest-free-energy conformation in the training distribution. For a kinase in drug discovery, this might be the inactive DFG-out conformation when the relevant therapeutic target is the active DFG-in state, or vice versa.

The conformational sampling problem was, and still is, a fundamental gap. Methods like RoseTTAFold All-Atom (from David Baker’s lab at the University of Washington) pursued similar unified atom-level approaches in parallel with AlphaFold 3, and the field is now converging on hybrid strategies that combine deep-learning structure prediction with explicit molecular dynamics to recover conformational diversity. AlphaFold 3’s diffusion module is a step toward this — the stochastic sampling means different seeds produce structurally distinct outputs — but it is not a full ensemble generator by design.


AlphaFold 3: Unified Molecular Complex Prediction

AlphaFold 3 protein structure prediction is architecturally distinct from its predecessor in three ways: it replaces the Evoformer with a lighter Pairformer; it adds a generative diffusion module that operates on raw atomic coordinates; and it tokenises all molecular entities — proteins, nucleic acids, ligands, ions — through a unified scheme so the model learns a single representation of molecular geometry.

AlphaFold 2 vs AlphaFold 3 pipeline: Evoformer and structure module vs Pairformer and diffusion

Figure 2: The pipeline shift from AlphaFold 2 (MSA-heavy Evoformer feeding a frame-based structure module) to AlphaFold 3 (lighter Pairformer feeding a diffusion module that generates atomic coordinates for the entire complex).

The Pairformer Replaces the Evoformer

The Evoformer’s MSA processing was computationally intensive and structurally tied to protein sequences. AlphaFold 3 retains the pairwise distance representation but strips out the deep MSA processing. The Pairformer still runs attention over the pair representation and a single sequence representation, but with a lighter MSA module feeding in summarised evolutionary features rather than a full per-residue MSA stack.

This is not a pure regression — the MSA signal still enters the model, but it is aggregated earlier and does not dominate the computation. The freed capacity goes toward learning representations that span all molecular entity types. Each atom in every molecule, regardless of type, is embedded into a shared token space before the Pairformer sees it.

The Diffusion Module: Generating Atomic Coordinates

The most significant architectural innovation is the diffusion module. Rather than building a 3D structure by updating rigid frames (AlphaFold 2’s approach), AlphaFold 3 samples atomic coordinates using a denoising diffusion process.

Diffusion models, popularised in image generation (Stable Diffusion, DALL-E), work by learning to reverse a noise-addition process. Training involves corrupting real data with Gaussian noise across a series of timesteps, then training the model to recover the original. At inference, the model starts from noise and iteratively denoises toward a plausible structure.

In AlphaFold 3, the diffusion module conditions its denoising on the pair and single representations produced by the Pairformer. It generates full all-atom coordinates — not backbone frames alone — in a continuous atomic coordinate space. This is what allows ligands (which have no concept of backbone frames) to be treated natively.

The paper by Abramson et al. (2024) in Nature describes the model training on the Protein Data Bank (PDB) and additional datasets covering nucleic acid structures and ligand complexes (doi: 10.1038/s41586-024-07487-w).

Unified Input Tokenisation

The third architectural pillar is the input representation. AlphaFold 3 tokenises:

  • Proteins as residue-level tokens with standard amino acid identity and MSA-derived features.
  • DNA and RNA as nucleotide-level tokens with analogous sequence features.
  • Small molecules and ions at the heavy-atom level, using a graph-based atom representation derived from the 2D chemical structure (SMILES or CCD code).
  • Covalent modifications (glycans, phosphorylation) by specifying modified residue identity.

All of these tokens enter a shared Input Embedder before being passed to the Pairformer. The model learns pairwise interactions between any two atoms in the complex — a protein residue and a ligand atom, a DNA base and a protein side-chain — within the same attention mechanism.

AlphaFold 3 inputs, model components, and outputs for full biomolecular complex prediction

Figure 3: The AlphaFold 3 system accepts proteins, nucleic acids, small molecules, and ions through a unified tokeniser; the Pairformer encodes pairwise geometry; and the diffusion module generates full all-atom coordinates with per-residue confidence estimates.


What AlphaFold 3 Can Actually Predict — and the Confidence Scores That Tell You When to Trust It

AlphaFold 3 can generate structural predictions for the following complex types that were previously out of reach: protein-DNA complexes (relevant for transcription factor binding, CRISPR guide design); protein-RNA complexes (splicing, translation, riboswitches); protein-small molecule complexes (drug discovery, enzyme catalysis); and antibody-antigen structures. For single proteins and standard protein-protein complexes, it achieves accuracy comparable to AlphaFold 2.

The confidence outputs matter as much as the coordinates:

  • pLDDT (per residue, 0–100): A direct measure of local structural confidence. Regions above 90 are treated as reliably ordered; 70–90 is generally good; below 50 should not be interpreted as a defined structure. Intrinsically disordered regions will consistently score low and will differ across multiple runs.
  • PAE (Predicted Aligned Error, per residue pair): Estimates the expected positional error of residue j given that residue i is correctly placed. Low PAE across a domain-domain interface means the relative orientation of those two domains is confidently predicted; high PAE means the arrangement could be wrong even if both domains are individually well-predicted. This is the most critical metric for assessing binding-site predictions.
  • pTM (predicted TM-score): A single summary score for the whole complex. Useful for quick filtering but less diagnostic than PAE for binding interfaces.

The AlphaFold Protein Structure Database (alphafold.ebi.ac.uk), maintained by EMBL-EBI, hosts over 200 million pre-computed predictions from AlphaFold 2. AlphaFold 3 structures are accessible via the AlphaFold Server (alphafoldserver.com), which allows users to submit custom complexes up to a specified chain count for non-commercial research.


AlphaFold 3 for Drug Discovery: The Workflow

AlphaFold 3 does not discover drugs. It provides structural hypotheses that guide and accelerate the early stages of a discovery pipeline. The diagram below shows where it fits — and the crucial wet-lab validation steps that must follow.

From AlphaFold 3 predicted structure to drug discovery: binding site analysis, virtual screening, and wet lab validation

Figure 4: A realistic drug discovery workflow using AlphaFold 3 predictions — confidence filtering gates the analysis, binding site identification feeds docking, and every computational result requires experimental validation before advancing.

The key contribution is enabling structure-based drug design (SBDD) for targets whose structures were previously unknown or hard to obtain. For a novel bacterial kinase with no close homologue in the PDB, or a membrane protein that resists crystallisation, an AlphaFold 3 prediction can unblock weeks of work.

Isomorphic Labs, DeepMind’s drug discovery subsidiary, is already using AlphaFold 3 in active discovery programmes. They have reported partnerships with large pharmaceutical companies (AstraZeneca, Eli Lilly) specifically citing AI-guided structure-based design. Publicly disclosed details remain limited, but the collaborations are structured around accelerating hit-to-lead campaigns using AI structural predictions — not replacing experimental validation, but compressing the timeline before costly biology investments are made.

For a sense of how structural predictions intersect with graph-based reasoning in other scientific domains, see GraphRAG and hybrid retrieval with knowledge graphs, which covers the same pattern of structured AI representations unlocking downstream analytical pipelines.

The drug discovery workflow for an AlphaFold 3 prediction typically proceeds as:

  1. Submit the target + known ligand (or apo form) to the AlphaFold Server or an on-premise deployment.
  2. Filter by confidence: reject or flag binding-site residues with pLDDT below 70 or high PAE at the interface. These regions need experimental structure before committing to SBDD.
  3. Binding site identification: use structural analysis tools (fpocket, DoGSiteScorer) to enumerate pockets in the predicted structure.
  4. Virtual screening: dock compound libraries (ZINC, Enamine) into high-confidence pockets using AutoDock-Vina or Glide.
  5. Structure-based design iterations: use the predicted complex to guide medicinal chemistry intuition — which vectors project into solvent, where a hydrogen bond donor could be added.
  6. Molecular dynamics: short MD simulations (tens to hundreds of nanoseconds) probe whether the predicted binding pose is stable, since AlphaFold 3 gives a single static snapshot.
  7. Wet lab validation: cryo-EM or X-ray crystallography confirms the binding mode; binding assays (SPR, ITC, or fluorescence-based) measure affinity. No compound advances on prediction alone.

The AlphaFold 3 prediction is most valuable at steps 1–5. It becomes less reliable when the target is intrinsically disordered (step 2 filters these out), when the binding pocket only forms upon ligand binding (induced-fit effects AlphaFold 3 does not explicitly model), or when the ligand is large and flexible.

Beyond Pharma: Structural Biology Research and Synthetic Biology

The impact of AlphaFold 3 extends well beyond pharmaceutical drug discovery. In basic research, the ability to predict protein-DNA complexes is reshaping how transcription factor biology is studied. Researchers investigating gene regulation in non-model organisms — where genetic tools are limited and experimental structure is rarely available — can now form structural hypotheses about which transcription factors bind which promoter sequences, based on predicted complex geometry. This has direct implications for synthetic biology, where designing new genetic circuits requires understanding protein-DNA recognition specificity at the structural level.

For antibody engineering, AlphaFold 3’s ability to predict antibody-antigen complexes provides a faster route from antigen sequence to structural understanding of the epitope. Antibody discovery campaigns traditionally required lengthy crystallisation of the Fab-antigen complex to understand binding geometry. Predicted structures do not replace this, but they accelerate early campaign decisions: which antibody candidates are likely binding overlapping epitopes, which framework regions are likely to create expression problems, where to introduce mutations to optimise affinity.

In structural genomics — the large-scale effort to systematically determine structures across entire proteomes — AlphaFold has partially displaced the need for brute-force experimental coverage. Resources can now be directed at the most uncertain regions: low-confidence AlphaFold predictions for novel folds, protein families with no structural representatives, and complex assemblies whose interfaces are biologically important. The prediction is the null hypothesis; experimental work either confirms it or reveals something genuinely new.


Trade-offs, Gotchas, and What Goes Wrong

AlphaFold 3 is a remarkable tool with real, non-trivial limitations that anyone using it for research or drug discovery must internalise. Treating predictions as ground truth is a category error that can waste months of downstream work.

Confidence is not accuracy for dynamics. A high pLDDT score means the model is confident about a static average structure. It says nothing about how much the protein moves in solution, how it transitions between functional states, or which conformation is relevant at physiological conditions. Proteins like GPCRs or intrinsically disordered proteins (IDPs) are defined by their conformational ensembles, not a single structure. AlphaFold 3 produces one answer; the right answer is a distribution.

Diffusion hallucination is real. Diffusion models are generative models. Given an underspecified or novel query — a sequence with no close homologues, a ligand class not well represented in training data — the model can generate a structurally plausible-looking output that is physically wrong. The pLDDT scores can be high for hallucinated regions because confidence is calibrated on training-distribution examples. Running multiple predictions and checking inter-run consistency is good practice for low-homology targets.

Disordered regions are systematically unreliable. Roughly 30–40% of human proteins contain significant intrinsically disordered regions (IDRs). These regions score low in pLDDT by design — the model has learned they are not stably folded. But this means AlphaFold 3 provides no useful structural information for a large fraction of the proteome that is functionally important (transcription factor activation domains, liquid-liquid phase separation regions, regulatory hubs).

Ligand geometry errors compound. The diffusion approach learns ligand geometry from the PDB, which has known biases toward drug-like small molecules. For unusual scaffolds, metalloorganic compounds, or covalent warheads, the predicted binding pose can have incorrect bond angles or ring conformations. Always validate predicted ligand geometry against basic energy minimisation before using it in docking.

No kinetics, no thermodynamics. AlphaFold 3 predicts structure. It provides no information about binding affinity (Kd), association/dissociation rates, or thermodynamic stability. Structural similarity between predicted and experimental poses does not correlate linearly with binding free energy — free energy perturbation methods or experimental assays are required for affinity information.

The openness gap. AlphaFold 2 was fully open-sourced under a permissive Apache 2.0 licence and became the foundation for hundreds of derivative tools. AlphaFold 3’s model weights are available for non-commercial academic research, but the licence prohibits commercial use and redistribution without DeepMind’s permission. The server provides a black-box API for commercial users. This creates a reproducibility constraint: a result generated via the AlphaFold Server cannot be independently replicated by someone who does not have server access, and the exact random seed matters for diffusion outputs.

Training data distribution effects. AlphaFold 3, like all deep-learning models, is bounded by what was in its training data. The PDB, while the largest curated repository of experimental protein structures, is heavily biased toward soluble, well-behaved proteins that yield to crystallisation or cryo-EM. Membrane proteins, which constitute roughly 25–30% of the human proteome and represent a disproportionately large fraction of drug targets (particularly GPCRs, ion channels, and transporters), are underrepresented. Predictions for these classes, while often better than nothing, carry heightened uncertainty that the confidence scores may not fully capture.

Stoichiometry and assembly errors. When submitting multi-chain complexes, AlphaFold 3 requires the user to specify the exact stoichiometry. If the biologically relevant assembly is a trimer but the user submits a dimer, the prediction is structurally plausible as a dimer but biologically wrong. The model does not infer stoichiometry from sequence alone. Experimental data on the oligomeric state — native PAGE, analytical ultracentrifugation, or size-exclusion chromatography coupled to multi-angle light scattering (SEC-MALS) — should always be gathered before committing to a complex prediction.


Practical Recommendations

For researchers and discovery teams integrating AlphaFold 3 into their workflows, these principles reduce the risk of drawing wrong conclusions from predictions.

Always examine confidence before examining structure. Load the pLDDT and PAE outputs before you look at the 3D coordinates. If the interface of interest has high PAE, the binding-site arrangement could be an artefact. Invest in experimental structure first for those regions.

Run multiple predictions. The diffusion module introduces stochasticity. Running five to ten predictions with different seeds and clustering the results gives a rough sense of conformational variability. Regions with consistent structure across runs are more reliable; regions with high structural variance across runs indicate local disorder or model uncertainty.

Treat ligand poses as starting geometries, not final answers. Use AlphaFold 3 predicted complexes as input to a docking or MD pipeline, not as the final binding mode. Even correct pocket identification with an incorrect pose orientation can mislead structure-based drug design.

Combine with experimental cryo-EM where stakes are high. Modern cryo-EM can resolve structures at near-atomic resolution (see cryo-EM reaching sub-ångström resolution for context on the current state of experimental validation). For a high-priority clinical target, AlphaFold 3 narrows the search space; cryo-EM or crystallography provides the data that justifies advancing a compound.

Keep the openness constraints in mind when publishing. If your research relies on a specific server-generated prediction, archive the output and document the input exactly. If reproducibility for peer review requires that a reader can regenerate your prediction, confirm whether the server API provides that.


Frequently Asked Questions

Is AlphaFold 3 accurate enough to replace experimental structure determination?

No — not as a replacement, but increasingly as a first screen. AlphaFold 3 protein structure prediction can produce near-experimental accuracy for well-represented protein families, but disordered regions, novel ligand scaffolds, and complex conformational dynamics still require experimental data. The appropriate framing is that AlphaFold 3 reduces the number of experimental structures needed by prioritising which targets warrant costly crystallisation or cryo-EM campaigns.

What is the difference between pLDDT and PAE in AlphaFold 3 outputs?

pLDDT scores each residue independently — it estimates how well the local geometry around that residue matches what a real folded protein would look like. PAE (Predicted Aligned Error) assesses relative confidence between pairs of residues: it answers whether residue A and residue B are in the right position relative to each other, even if both are individually well-folded. For drug discovery, PAE at the binding interface is the more critical metric than pLDDT alone.

Can AlphaFold 3 predict how a protein binds a small-molecule drug?

Yes, this is one of AlphaFold 3’s core additions over AlphaFold 2. Given a protein sequence and a ligand specified by CCD code or SMILES, the model predicts the bound complex. However, the predicted binding pose should be treated as a hypothesis for further validation. The model can mis-predict pocket geometry for rare chemotypes, and it does not model induced-fit binding where the protein changes shape upon ligand binding. Use the prediction as a starting point for docking and molecular dynamics simulations.

What does “diffusion model protein folding” mean in practice?

Diffusion models generate data by learning to reverse a noise process. During training, AlphaFold 3 sees real protein structures with Gaussian noise progressively added to atomic coordinates, and learns to denoise them. At prediction time, it starts from random noise and iteratively refines toward a plausible structure, conditioned on sequence and pairwise-distance information from the Pairformer. This is fundamentally different from the frame-based prediction in AlphaFold 2 and is what enables the model to generate all-atom coordinates for arbitrary molecular entities including non-protein chemistry such as ligands, DNA, and RNA.

Is AlphaFold 3 available for free?

For non-commercial academic research, yes — via the AlphaFold Server at alphafoldserver.com. The model weights are also available under a restricted non-commercial licence for academic use. Commercial use requires a separate licence from Google DeepMind and Isomorphic Labs. This is a significant change from AlphaFold 2, whose Apache 2.0 open-source release enabled widespread derivative tool development by the broader research community.

How does AlphaFold 3 handle protein complexes with multiple chains?

AlphaFold 3 natively supports multi-chain inputs. All chains — protein, DNA, RNA — and all small molecules and ions are submitted simultaneously and co-embedded by the unified tokeniser. The Pairformer learns cross-chain pairwise interactions, and the diffusion module generates coordinates for all entities jointly. This is why the model can predict heterodimer interfaces, antibody-antigen complexes, and transcription factor-DNA binding modes that were not accessible to AlphaFold 2 without significant workarounds.


Further Reading


By Riju — about.

Comments

No comments yet. Why don’t you start the discussion?

Leave a Reply

Your email address will not be published. Required fields are marked *