agedi.api¶

Public API for AGeDi.

Re-exports all public symbols from the api sub-modules so that from agedi.api import X works for every user-facing name.

Submodules¶

Functions¶

`register_model`(→ None)	Register a custom score model backbone factory under name.
`create_dataset`(→ agedi.data.Dataset)	Create and setup an AGeDi Dataset from ASE Atoms objects.
`create_diffusion`(, sde, SDE, None] = None, ...)	Create a diffusion model for script-based training and sampling.
`load_diffusion`(→ Agedi)	Load a trained diffusion model from an AGeDi log directory.
`predict`(→ List[ase.Atoms])	Predict energies and forces for input structures using a trained force-field.
`sample`(→ Union[List[agedi.data.AtomsGraph], ...)	Sample structures from a trained diffusion model.
`create_trainer`(→ lightning.Trainer)	Create a Lightning trainer configured for AGeDi.
`train`(→ lightning.Trainer)	Train a diffusion model and return the trainer used.
`train_from_atoms`(, sde, SDE, None] = None, ...)	Build (or restore), train, and return an AGeDi model from ASE Atoms data.
`train_from_config`(→ Tuple[Agedi, agedi.data.Dataset, ...)	Train an AGeDi model from a YAML configuration file or dictionary.

Package Contents¶

agedi.api.register_model(name: str, factory: Callable) → None¶

The factory is called with the keyword arguments cutoff, heads, feature_size, n_blocks, head_dim, and n_rbf and must return a 3-tuple (translator, representation, List[Head]).

Registered models can be selected by passing model=name to create_diffusion().

Parameters:

name (str) – Alias used to select this backend (e.g. "PaiNN").

factory (Callable) –

Factory function with signature:

factory(cutoff, heads, feature_size, n_blocks, head_dim, n_rbf)
    -> Tuple[Translator, nn.Module, List[Head]]

Examples

from agedi.functional import register_model

def my_factory(cutoff, heads, feature_size, n_blocks, head_dim, n_rbf):
    ...
    return translator, representation, head_list

register_model("MyModel", my_factory)

agedi.api.create_dataset(data: Sequence[ase.Atoms], cutoff: float | None = None, batch_size: int = 64, train_split: float | int = 0.9, val_split: float | int = 0.1, mask: str = 'none', confinement: Tuple[float, float] | None = None, conditioning: str = 'none', conditioning_type: str = 'scalar', repeat: int | None = None, canonical_cell: bool = False, regressor_data: Sequence[ase.Atoms] | None = None, properties: List[Dict] | None = None, fully_connected: bool = False) → agedi.data.Dataset¶

Create and setup an AGeDi Dataset from ASE Atoms objects.

Parameters:

data (Sequence[Atoms]) – ASE Atoms objects to add to the dataset.
cutoff (float, optional) – Neighbour-list cutoff radius in Ångström.
batch_size (int, optional) – Mini-batch size used during training/validation.
train_split (Union[float, int], optional) – Fraction or absolute number of samples for the training split.
val_split (Union[float, int], optional) – Fraction or absolute number of samples for the validation split.
mask (str, optional) – Atom-mask method (e.g. "MaskFixed" or "none").
confinement (Tuple[float, float], optional) – Z-axis confinement bounds (z_min, z_max).
conditioning (str, optional) – Name of the per-structure property to use as a conditioning signal. The value is read from atoms.info[conditioning] or the corresponding atoms.get_<conditioning>() method. Ignored when set to "none" (default).
conditioning_type (str, optional) – "scalar" (default) or "node"; controls how the conditioning property is broadcast onto the graph.
repeat (int, optional) – When given, augment the dataset by repeating each structure up to repeat times along the first two cell vectors.
canonical_cell (bool, optional) – Store cells in canonical lower-triangular form.
regressor_data (Sequence[Atoms], optional) – Additional ASE Atoms objects used to train a regressor head.
properties (List[Dict], optional) – Per-structure property dictionaries; must contain exactly one entry per element in data. Each dictionary is merged into the corresponding graph object via setattr, matching the layout accepted by add_atoms_data(). Keys already produced by the conditioning logic are overwritten by values in properties when both are present.

Returns:

A fully set-up Dataset ready for training.

Return type:

Dataset

agedi.api.create_diffusion(model: str = 'PaiNN', cutoff: float | None = None, feature_size: int = 64, n_blocks: int = 4, n_rbf: int = 30, noisers: Sequence[str | Noiser] = ('CellPositions',), sde: str | SDE | None = None, conditioning: str = 'none', conditioning_type: str = 'scalar', confinement: Tuple[float, float] | None = None, force_field: bool = False, lr: float = 0.0001, lr_factor: float = 0.95, lr_patience: int = 100, weight_decay: float = 0.0, eps: float = 1e-05, guidance_weight: float = -1.0, device: str | torch.device | None = None, type_map: List[int] | None = None, prediction_type: str = 'score', sampler: str = 'em', loss_weighting: str = 'uniform', fully_connected: bool = False) → agedi.Agedi¶

Create a diffusion model for script-based training and sampling.

Parameters:

model (str, optional) – GNN backbone architecture. The name is looked up in the model registry; use register_model() to add custom backends. The built-in default is "PaiNN" (SchNetPack PaiNN).
cutoff (float, optional) – Neighbour-list cutoff radius in Å. Defaults to 6.0.
feature_size (int, optional) – Embedding / feature dimension. Defaults to 64.
n_blocks (int, optional) – Number of interaction blocks. Defaults to 4.
n_rbf (int, optional) – Number of radial basis functions. Defaults to 30.
noisers (Sequence[str or Noiser], optional) –
Noiser identifiers or instances to include. Defaults to ("CellPositions",). Recognised string identifiers (CamelCase preferred; snake_case aliases also accepted for backwards compatibility):
- "Positions" / "positions" – Positions (StandardNormal prior + Normal, for gas-phase clusters).
- "CellPositions" / "cell_positions" – CellPositions (UniformCell prior + Normal, for periodic bulk/surface systems).
- "ConfinedCellPositions" / "confined_cell_positions" – ConfinedCellPositions (UniformCellConfined prior + TruncatedNormal, for Z-confined systems).
- "Types" / "types" – Types.
sde (str or SDE, optional) – SDE for position noisers. Short aliases: "ve" (default), "vp". Pass an instantiated SDE for full control.
conditioning (str, optional) – Property to condition on, or "none" for time-only conditioning. Defaults to "none".
conditioning_type (str, optional) – Type of the conditioning module: "scalar" or "integer". Defaults to "scalar".
confinement (Tuple[float, float], optional) – Z-direction confinement bounds (z_min, z_max) in Å.
force_field (bool, optional) – When True, attach a diffusion.regressor_model. The heads shares the same representation and translator as the score model so that atomic embeddings are learned jointly. It is trained whenever the training batch contains per-atom forces and total energies (i.e. the ASE training structures have DFT (or other) energy and forces). The trained forces head enables force-field guided sampling via ForcefieldGuidanceConfig. Defaults to False.
lr (float, optional) – Learning rate. Defaults to 1e-4.
lr_factor (float, optional) – LR-scheduler reduction factor. Defaults to 0.95.
lr_patience (int, optional) – LR-scheduler patience (epochs). Defaults to 100.
weight_decay (float, optional) – Optimizer weight-decay. Defaults to 0.0.
eps (float, optional) – Minimum diffusion time. Defaults to 1e-5.
guidance_weight (float, optional) – Classifier-free guidance weight. Defaults to -1.0 (disabled).
device (str or torch.device, optional) – Target compute device. When None CUDA is used if available, otherwise CPU.
type_map (List[int], optional) – Compact type map for the Types noiser. type_map[0] must be 0 (absorbing state) and type_map[i] is the atomic number for compact index i. When provided, the Types noiser and the TypesScore head use a reduced vocabulary of size len(type_map) instead of the default 100. Auto-populated by train_from_atoms() when a "Types" noiser is requested.

Returns:

A freshly initialised Agedi model.

Return type:

Agedi

Load a trained diffusion model from an AGeDi log directory.

The model architecture is fully reconstructed from the Hydra-compatible diffusion config stored in hparams.yaml, so no additional parameters are needed.

Parameters:

path – Path to the AGeDi log / model directory (or directly to the hparams.yaml file).
checkpoint – Path to a specific checkpoint file. When None the latest checkpoint (checkpoints/last_model.ckpt) is loaded automatically.
device – Device to load the model onto. When None CUDA is used if available, otherwise CPU.

agedi.api.predict(diffusion: Agedi, structures: Sequence[ase.Atoms], *, batch_size: int = 64, cutoff: float | None = None) → List[ase.Atoms]¶

Predict energies and forces for input structures using a trained force-field.

The model must have been trained with force_field=True (i.e. it must have a regressor_model attached). The predicted energy and forces are attached to the returned Atoms objects via an SinglePointCalculator.

Parameters:

diffusion – A trained Agedi model with a force-field regressor (trained with --force_field).
structures – Input ASE Atoms objects to run predictions on.
batch_size – Number of structures per inference batch. Defaults to 64.
cutoff – Neighbour-list cutoff in Å. When None (default), the cutoff is read from the model’s representation automatically.

Returns:

The input structures with a SinglePointCalculator attached containing the predicted energy and/or forces.

Return type:

List[Atoms]

Raises:

ValueError – If the model does not have a force-field regressor.

agedi.api.sample(diffusion: Agedi, *, n_samples: int, n_atoms: int | None = None, atomic_numbers: List[int] | None = None, formula: str | None = None, positions: numpy.ndarray | None = None, cell: numpy.ndarray | None = None, pbc: numpy.ndarray | None = None, template: agedi.data.AtomsGraph | ase.Atoms | None = None, confinement: Tuple[float, float] | None = None, compile: bool = False, steps: int = 500, eps: float = 0.001, batch_size: int = 64, ff_guidance: agedi.diffusion.ForcefieldGuidanceConfig | None = None, property: Dict[str, float] | None = None, progress_bar: bool = False, save_trajectory: bool = False, print_timings: bool = False, as_atoms: bool = True, sampler=None, sampler_kwargs=None) → List[agedi.data.AtomsGraph] | List[ase.Atoms] | List[List[agedi.data.AtomsGraph]] | List[List[ase.Atoms]]¶

Sample structures from a trained diffusion model.

Parameters:

diffusion – A trained Agedi model.
n_samples – Number of structures to generate.
n_atoms – Number of atoms per structure. Automatically determined from formula if provided, or from the length of atomic_numbers when n_atoms is not explicitly given.
atomic_numbers – Atomic numbers of the generated atoms. Not required when the model has a types-noiser or when formula is provided.
formula – Chemical formula (e.g. "H2O"). Used to derive n_atoms and atomic_numbers when they are not provided explicitly.
positions – Fixed positions of the atoms (shape (n_atoms, 3)). Required when no positions-noiser is configured (type-only diffusion). Positions will not be modified during sampling.
cell – Unit-cell matrix (3×3 array or flat length-9 array). Not required when template is provided (the template’s cell is used instead).
pbc – Periodic boundary conditions as a length-3 boolean array (e.g. [True, True, False]). When template is provided its pbc is used unless this argument is given explicitly. Defaults to [True, True, True] (fully periodic) when neither template nor pbc is supplied.
template – Template structure. May be an AtomsGraph or an ASE Atoms object; the latter is automatically converted to an AtomsGraph (with confinement applied when provided). When given, cell and pbc are taken from the template unless explicitly provided.
ff_guidance – Force-field guidance configuration. When None (default) a ForcefieldGuidanceConfig with default values is used (i.e. guidance is disabled).
compile – When True, use torch.compile on the reverse diffusion step for faster sampling. Before the sampling loop starts, the maximum number of neighbors and cell-list dimensions are estimated automatically via NVIDIA nvalchemiops (estimate_max_neighbors and estimate_cell_list_sizes), and all neighbor-list buffers are pre-allocated with fixed shapes. Requires NVIDIA nvalchemiops. Defaults to False.
print_timings – When True, print a per-stage timing breakdown at the end of each sampling batch (graph init, score model, denoise, neighbor list, etc.). Defaults to False.

agedi.api.create_trainer(*, epochs: int = -1, max_time: int | Dict | datetime.timedelta | None = 24, accelerator: str = 'auto', devices: int = 1, logger: str = 'tensorboard', log_dir: str = 'logs', project: str = 'agedi', name: str = 'agedi', log_interval: int = 10, gradient_clip_val: float = 10.0, progress_bar: bool = False, print_epoch_interval: int = 10, log_grad_norm: bool = True, repeat: int | None = None, repeat_epoch: int | None = None, hparams: Dict | None = None, extra_callbacks: List[lightning.pytorch.callbacks.Callback] | None = None) → lightning.Trainer¶

Create a Lightning trainer configured for AGeDi.

Parameters:

epochs – Maximum number of training epochs (-1 = unlimited).
max_time –
Wall-clock time limit for training. Accepts:
- int – number of hours (e.g. 24 ≡ 24 hours).
- dict – Lightning-style mapping, e.g. {"days": 0, "hours": 12, "minutes": 30, "seconds": 0}.
- datetime.timedelta – a Python timedelta object.
- None – no time limit.
accelerator – Hardware accelerator to use (e.g. "auto", "gpu", "cpu"). Default: "auto".
devices – Number of devices to train on. Default: 1.
logger – Logging backend: "tensorboard" (default) or "wandb".
log_dir – Root directory for logs and checkpoints. Default: "logs".
project – WandB project name (only used when logger="wandb").
name – Experiment display name used by TensorBoard and WandB as the run sub-directory / run name. Default: "agedi".
log_interval – How often (in steps) to log metrics. Default: 10.
gradient_clip_val – Maximum gradient norm for gradient clipping. Default: 10.0.
progress_bar – Whether to show a Lightning progress bar. Default: False.
print_epoch_interval – Print a one-line training summary to stdout every this many epochs. Set to 0 to disable. Default: 10.
log_grad_norm – Whether to log the total gradient norm during training. Disable for large models where the per-step overhead is undesirable. Default: True.
repeat – Number of repetition levels for cell-repeat data augmentation. Must be set together with repeat_epoch. When None (default), no repetition augmentation is applied.
repeat_epoch – How many epochs between repetition-level increases. Required when repeat is set.
hparams – Hyperparameters dict logged to hparams.yaml via HParamsMetricLogger. When None (default), no extra hyperparameter logging is performed.
extra_callbacks – Extra Lightning callbacks to append to the default callback list. When None (default) only the built-in callbacks are used.

Returns:

A configured Trainer ready to call trainer.fit(diffusion, dataset).

Return type:

lightning.Trainer

agedi.api.train(diffusion: Agedi, dataset: agedi.data.Dataset, trainer: lightning.Trainer | None = None, ckpt_path: str | pathlib.Path | None = None, **trainer_kwargs) → lightning.Trainer¶

Train a diffusion model and return the trainer used.

Parameters:

diffusion – The diffusion model to train.
dataset – The dataset to train on.
trainer – A pre-configured Lightning Trainer. When None a new trainer is created from trainer_kwargs.
ckpt_path – Path to a Lightning checkpoint (.ckpt) to resume training from. When provided the full training state (model weights, optimiser, LR-scheduler, and epoch counter) is restored before fitting. Equivalent to passing ckpt_path to trainer.fit().
**trainer_kwargs – Additional keyword arguments forwarded to create_trainer() when trainer is None.

agedi.api.train_from_atoms(data: Sequence[ase.Atoms], *, model: str = 'PaiNN', cutoff: float | None = None, feature_size: int = 64, n_blocks: int = 4, n_rbf: int = 30, noisers: Sequence[str] = ('CellPositions',), sde: str | SDE | None = None, conditioning: str = 'none', conditioning_type: str = 'scalar', mask: str = 'none', confinement: Tuple[float, float] | None = None, force_field: bool = False, batch_size: int = 64, train_split: float | int = 0.9, val_split: float | int = 0.1, repeat: int | None = None, canonical_cell: bool = False, lr: float = 0.0001, lr_factor: float = 0.95, lr_patience: int = 100, weight_decay: float = 0.0, eps: float = 1e-05, guidance_weight: float = -1.0, data_path: str | None = None, regressor_data: Sequence[ase.Atoms] | None = None, checkpoint: str | pathlib.Path | None = None, trainer: lightning.Trainer | None = None, n_classes: int | None = None, prediction_type: str = 'score', sampler: str = 'em', loss_weighting: str = 'uniform', fully_connected: bool = False, **trainer_kwargs) → Tuple[Agedi, agedi.data.Dataset, lightning.Trainer]¶

Build (or restore), train, and return an AGeDi model from ASE Atoms data.

When a "Types" noiser is included and no checkpoint is given, the unique element types present in data are automatically detected and a compact type map is built so that the vocabulary size equals the number of distinct element types (plus the absorbing state at index 0). The n_classes parameter can be used to restrict the vocabulary to the n_classes most frequently occurring element types (sorted by atomic number).

Parameters:

data – ASE Atoms objects to train on.
model – GNN backbone architecture name. Looked up in the model registry; use register_model() to add custom backends. Default: "PaiNN" (SchNetPack PaiNN).
cutoff – Neighbour-list cutoff radius in Å. Default: 6.0.
feature_size – Embedding / feature dimension. Default: 64.
n_blocks – Number of interaction blocks in the GNN backbone. Default: 4.
n_rbf – Number of radial basis functions. Default: 30.
noisers – Sequence of noiser identifiers. Recognised string identifiers: "Positions", "CellPositions", "ConfinedCellPositions", "Types" (snake_case aliases also accepted). Default: ("CellPositions",).
sde – SDE for position noisers. Short aliases: "ve" (default), "vp". Pass an instantiated SDE for full control.
conditioning – Per-structure property to condition on (read from atoms.info[conditioning] or atoms.get_<conditioning>()), or "none" for time-only conditioning (default).
conditioning_type – Type of the conditioning module: "scalar" (default) or "integer".
mask – Atom-masking strategy: "MaskFixed" (freeze atoms tagged with ASE FixAtoms) or "none" (default).
confinement – Z-direction confinement bounds (z_min, z_max) in Å. Required when using the "ConfinedCellPositions" noiser.
force_field – When True, attach a regressor head (sharing the backbone) that predicts per-atom forces and total energy. Enables force-field guided sampling via ForcefieldGuidanceConfig. The training data must contain DFT (or other) forces and energy. Default: False.
batch_size – Mini-batch size used during training. Default: 64.
train_split – Fraction or absolute count of structures for the training split. Default: 0.9.
val_split – Fraction or absolute count of structures for the validation split. Default: 0.1.
repeat – When given, augment the dataset by repeating each structure up to repeat times along the first two cell vectors. Requires repeat_epoch (passed via **trainer_kwargs) to specify how often the repetition level increases.
canonical_cell – Store unit cells in canonical lower-triangular form. Default: False.
lr – Learning rate. Default: 1e-4.
lr_factor – LR-scheduler reduction factor. Default: 0.95.
lr_patience – LR-scheduler patience (epochs). Default: 100.
weight_decay – Optimiser weight decay. Default: 0.0.
eps – Minimum diffusion time value. Default: 1e-5.
guidance_weight – Classifier-free guidance weight. Default: -1.0 (disabled).
data_path – String path to the training data file; stored in hparams.yaml for reference only. When None, no path metadata is saved.
regressor_data – Optional additional ASE Atoms objects used exclusively for training the force-field regressor head. Structures here are never passed through the diffusion loss. Each structure must have an ASE calculator with energy and forces attached.
checkpoint – Path to a previously saved run directory (containing hparams.yaml) or directly to a .ckpt checkpoint file. When provided the model architecture and weights are loaded from the checkpoint instead of being built from the architecture parameters (model, cutoff, feature_size, etc.). The full training state (optimiser, LR-scheduler, epoch counter) is also restored so that training continues seamlessly. Supply data to train on new data, or use the original data path to resume on the same dataset.
trainer – A pre-configured Lightning Trainer. When None (default) a new trainer is built from **trainer_kwargs.
n_classes – Number of element-type classes to use for the Types noiser (not counting the absorbing state at index 0). When None (default), all distinct element types present in data are used. Must not exceed the number of distinct types in the training data. Ignored when checkpoint is provided (the vocabulary is loaded from the checkpoint).
**trainer_kwargs – Additional keyword arguments forwarded to create_trainer() when trainer is None. Common keys: epochs, max_time, logger, log_dir, gradient_clip_val, repeat_epoch.

Returns:

The trained diffusion model, the dataset, and the Lightning trainer.

Return type:

Tuple[Agedi, Dataset, Trainer]

agedi.api.train_from_config(config: str | pathlib.Path | Dict) → Tuple[Agedi, agedi.data.Dataset, lightning.Trainer]¶

Train an AGeDi model from a YAML configuration file or dictionary.

This is the Hydra-style entry point. The configuration can be provided as:

a path to a YAML file (str or Path),
a plain Python dict,
a Hydra / OmegaConf DictConfig.

The function loads the training data from config["data_path"] (an ASE-readable file) and delegates to train_from_atoms() with the remaining configuration values.

The minimal required key is data_path. All other keys are optional and fall back to the same defaults as train_from_atoms().

A ready-to-edit template is shipped with the package at agedi/conf/train.yaml.

Parameters:: config – Configuration source – a YAML file path, a dict, or an OmegaConf DictConfig.
Returns:: The trained diffusion model, the dataset used, and the Lightning trainer.
Return type:: Tuple[Agedi, Dataset, Trainer]

Examples

Minimal Python usage:

from agedi import train_from_config
diffusion, dataset, trainer = train_from_config("conf/train.yaml")

Programmatic override:

from agedi import train_from_config
cfg = {"data_path": "train.traj", "epochs": 50, "feature_size": 128}
diffusion, _, _ = train_from_config(cfg)