3. Screening¶

In order to compute the deliverable capacity of a material, you need to compute the methane loading both at the loading and at the discharge pressure, and then do some simple math - in other words, a simple workflow. AiiDA provides WorkChains to orchestrate the running of calculations.

We’ve prepared a WorkChain to compute the deliverable methane capacity.

Download it from here and place the file in some directory, then add the path to this directory to PYTHONPATH and restart the daemon:

$ export PYTHONPATH=/path/to/your/directory/:$PYTHONPATH
$ verdi daemon restart

Now, we analyze step by step the WorkChain, and we will see later how to run it.

3.1. Step 0: Defining inputs, outputs and execution steps¶

AiiDA WorkChains are python classes. The define method specifies the inputs, the steps of execution and the outputs of the WorkChain:

class DcMethane(WorkChain):
    """Compute methane deliverable capacity for a given structure."""

    @classmethod
    def define(cls, spec):
        """Define workflow specification.

        This is the most important method of a Workchain, which defines the
        inputs it takes, the logic of the execution and the outputs
        that are generated in the process.
        """
        super(DcMethane, cls).define(spec)

        # First we define the inputs, specifying the type we expect

        spec.input("structure", valid_type=CifData, required=True)
        spec.input(
            "zeopp_parameters", valid_type=NetworkParameters, required=True)
        spec.input("raspa_parameters", valid_type=ParameterData, required=True)
        spec.input("atomic_radii", valid_type=SingleFile, required=True)
        spec.input("zeopp_code", valid_type=Code, required=True)
        spec.input("raspa_code", valid_type=Code, required=True)

        # The outline describes the basic logic that defines
        # which steps are executed in what order and based on
        # what conditions. Each `cls.method` is implemented below
        spec.outline(
            cls.init,
            cls.run_block_zeopp,
            cls.run_loading_raspa_low_p,
            cls.parse_loading_raspa,
            cls.run_loading_raspa_high_p,
            cls.parse_loading_raspa,
            cls.extract_results,
        )

        # Here we define the output the Workchain will generate and
        # return. Dynamic output allows a variety of AiiDA data nodes
        # to be returned
        spec.dynamic_output()

The DcMethane uses spec.input() to specify the following inputs: 1. The structure in CifData format 1. Raspa parameters of type ParameterData 1. Zeo++ parameters of type NetworkParameters 1. A file containing specification of atomic radii (of type SingleFile) 1. Zeo++ and Raspa codes (of type Code).

It then uses spec.outline() to specify the steps of the workchain (using functions defined below). All 7 steps will be executed in order.

3.2. Step 1: Prepare input parameters and variables¶

 def init(self):
     """Initialize variables."""

     self.ctx.loading_average = {}
     self.ctx.loading_dev = {}

     self.ctx.options = {
         "resources": {
             "num_machines": 1,
             "tot_num_mpiprocs": 1,
             "num_mpiprocs_per_machine": 1,
         },
         "max_wallclock_seconds": 4 * 60 * 60,
         "max_memory_kb": 2000000,
            "queue_name": "molsim",
         "withmpi": False,
     }

| **Note**
| The **context** (``self.ctx``) variable is a container that is
accessible by every function in the ``DcMethane`` workchain. In this
particular case we are creating two empty dictionaries to store the
loading average and its deviation at different pressures.

3.3. Step 2: Compute the geometric parameters of the MOFs.¶

As described in Setting for Raspa:

BlockPockets and BlockPocketsFileName will be filled by AiiDA: if Zeo++ finds some non accessible pore volume, it can generate a .block file with the positions and the radii of blocking spheres. These spheres are inserted in the framework to prevent Raspa from inserting molecules in the non accessible pore.

Here we will compute blocked pockets of a particular material employing the Zeo++ code.

def run_block_zeopp(self):
    """This function will perform a zeo++ calculation to obtain the blocked pockets."""

    # Create the input dictionary
    inputs = {
        'code': self.inputs.zeopp_code,
        'structure': self.inputs.structure,
        'parameters': self.inputs.zeopp_parameters,
        'atomic_radii': self.inputs.atomic_radii,
        '_options': self.ctx.options,
    }

    # Create the calculation process and launch it
    process = ZeoppCalculation.process()
    future = submit(process, **inputs)
    self.report("pk: {} | Running Zeo++ to obtain blocked pockets".format(
        future.pid))

    return ToContext(zeopp=Outputs(future))

As you can see: in order to run the calculation one just needs to provide code, structure, parameters and atomic_radii file that are all directly taken from the workflow inputs. The job submission happens in exactly the same way as it was for the single raspa calculation that we tried previously.

Note

The self.report() functions provides a convenient way to report the status of a workflow that can be access from the verdi command line via verdi work report <PK>

3.4. Step 3, 5: Compute the methane loading¶

Steps 3 (run_loading_raspa_low_p) and 5 (run_loading_raspa_high_p) compute the methane loading at 5.8 and 65 bars respectively in [molecules/cell] units. The functions are defined as follows:

def run_loading_raspa_low_p(self):
    self.ctx.current_pressure = 5.8e5
    self.ctx.current_pressure_label = "low"
    return self._run_loading_raspa()

def run_loading_raspa_high_p(self):
    self.ctx.current_pressure = 65e5
    self.ctx.current_pressure_label = "high"
    return self._run_loading_raspa()

and finally execute the same _run_loading_raspa function.

def _run_loading_raspa(self):
    """Perform raspa calculation at given pressure.

    Most of the runtime will be spent in this function.
    """
    # Create the input dictionary
    inputs = {
        'code': self.inputs.raspa_code,
        'structure': self.inputs.structure,
        'parameters': update_raspa_parameters(self.inputs.raspa_parameters, Float(self.ctx.current_pressure)),
        'block_component_0': self.ctx.zeopp['block'],
        '_options': self.ctx.options,
    }

    # Create the calculation process and launch it
    process = RaspaCalculation.process()
    future = submit(process, **inputs)
    self.report("pk: {} | Running raspa for the pressure {} [bar]".format(future.pid, self.ctx.current_pressure / 1e5))

    return ToContext(raspa=Outputs(future))

ToContext() will create a variable self.ctx.raspa that will contain the results of the calculation.

Outputs() function will wait for the calculation to be completed.

3.5. Step 4, 6: Extract pressure and methane loading¶

Steps 4 and 6 extract pressure and methane loading (with deviation) and puts them into loading_average and loading_dev dictionaries stored in the context.

"""Extract pressure and loading average of last completed raspa calculation."""
loading_average = self.ctx.raspa["component_0"].dict.loading_absolute_average
loading_dev = self.ctx.raspa["component_0"].dict.loading_absolute_dev
self.ctx.loading_average[self.ctx.current_pressure_label] = loading_average
self.ctx.loading_dev[self.ctx.current_pressure_label] = loading_dev

3.6. Step 7: Store the selected computed parameters as the output node¶

This final step is to prepare the results of the DcMethane workchain extracting the most relevant information and putting it in a ParameterData object.

In particular, we extract the deliverable capacities at low and high pressures that are computed in previous steps. We transform data from [molecule/unit cell] units to [cm:sup:3_STP/cm3] using the conversion factor provided by raspa (conversion_factor_molec_uc_to_cm3stp_cm3). We also [compute] (https://en.wikipedia.org/wiki/Sum_of_normally_distributed_random_variables) the standard deviation of the difference.

def extract_results(self):
    """Extract results of the workflow.

    Attaches the results of the raspa calculation and the initial structure to the outputs.
    """
    from math import sqrt
    cf = self.ctx.raspa["component_0"].dict.conversion_factor_molec_uc_to_cm3stp_cm3
    dc = self.ctx.loading_average["high"] - self.ctx.loading_average["low"]
    dc_dev = sqrt(self.ctx.loading_dev["high"]**2 + self.ctx.loading_dev["low"]**2)

    result = {
        "deliverable_capacity": dc * cf,
        "deliverable_capacity_units": "cm^3_STP/cm^3",
        "deliverable_capacity_dev":  dc_dev * cf,
        "loading_absolute_average_low_p" : self.ctx.loading_average["low"] * cf,
        "loading_absolute_dev_low_p" : self.ctx.loading_dev["low"] * cf,
        "loading_units" : "cm^3_STP/cm^3",
        "loading_absolute_average_high_p" : self.ctx.loading_average["high"] * cf,
        "loading_absolute_dev_high_p" : self.ctx.loading_dev["high"] * cf,
    }
    self.out("result", ParameterData(dict=result))

    self.report("Workchain <{}> completed successfully".format(
        self.calc.pk))
    return

3.7. Exercises¶

Before you actually start doing the calculations please setup the zeo++ code as shown here:

* PK:             60109
* UUID:           8a37224a-1247-4484-b3c8-ce3e8f37cee7
* Label:          zeopp
* Description:    zeo++ code for the molsim course
* Default plugin: zeopp.network
* Used by:        1 calculations
* Type:           remote
* Remote machine: bazis
* Remote absolute path:
  /home/molsim20/network
* prepend text:
  # No prepend text.
* append text:
  # No append text.

Should you have any doubts, just consult the Computer setup and configuration part of our tutorial.

The following script is necessary to run the DcMethane workchain. You need to save it as run_DcMethane.py, edit it with your settings and run it with verdi run run_DcMethane.py.

from aiida.backends.utils import load_dbenv, is_dbenv_loaded
if not is_dbenv_loaded():
        load_dbenv()

import os
import sys
import time
from deliverable_capacity import DcMethane

from aiida.orm import DataFactory
from aiida.orm.data.cif import CifData
from aiida.orm.data.base import Float
from aiida.work.run import run, submit

NetworkParameters = DataFactory('zeopp.parameters')
ParameterData = DataFactory('parameter')

def multiply_unit_cell(cif, cutoff):
    """Returns the multiplication factors (tuple of 3 int) for the cell vectors
    that are needed to respect: min(perpendicular_width) > threshold
    """
    from math import cos, sin, sqrt, pi
    import numpy as np
    deg2rad=pi/180.
    struct=cif.values.dictionary.itervalues().next()
    a = float(struct['_cell_length_a'])
    b = float(struct['_cell_length_b'])
    c = float(struct['_cell_length_c'])
    alpha = float(struct['_cell_angle_alpha'])*deg2rad
    beta  = float(struct['_cell_angle_beta'])*deg2rad
    gamma = float(struct['_cell_angle_gamma'])*deg2rad
    v = sqrt(1-cos(alpha)**2-cos(beta)**2-cos(gamma)**2+2*cos(alpha)*cos(beta)*cos(gamma))
    cell=np.zeros((3,3))
    cell[0,:] = [a, 0, 0]
    cell[1,:] = [b*cos(gamma), b*sin(gamma),0]
    cell[2,:] = [c*cos(beta), c*(cos(alpha)-cos(beta)*cos(gamma))/(sin(gamma)),c*v/sin(gamma)]
    cell=np.array(cell)
    diag = np.diag(cell)
    return tuple(int(i) for i in np.ceil(cutoff/diag*2.))

cutoff = 8.8         #TO EDIT
probe_radius = 1.865 #Why this value?

zeopp_params = NetworkParameters(dict={
    'ha': True,
    'block': [probe_radius, 100],
})

# Search for the structures to evaluate and submit them
q = QueryBuilder()
q.append(CifData, filters={'label': { 'in': [ ...] }}) #TO EDIT: provide labels of the structures you want to submit

for item in q.all():
    cif = item[0]
    print (cif)
    nx, ny, nz = multiply_unit_cell(cif, cutoff)
    unitscells="{} {} {}".format(nx,ny,nz)

    raspa_params = ParameterData(dict={
        "GeneralSettings":
        {
        "SimulationType"                   : "MonteCarlo",
        "NumberOfCycles"                   : 888,  #TO EDIT
        "NumberOfInitializationCycles"     : 888,  #TO EDIT
        "PrintEvery"                       : 100,

        "CutOff"                           : cutoff,

        "Forcefield"                       : "UFF-TraPPE",
        "ChargeMethod"                     : "None",
        "UnitCells"                        : "<int> <int> <int>",
        "ExternalTemperature"              : 298,

        },
        "Component":
        [{
        "MoleculeName"                     : "methane",
        "MoleculeDefinition"               : "TraPPE",
        "MolFraction"                      : 1.0,
        "TranslationProbability"           : 8.8, #TO EDIT
        "RotationProbability"              : 8.8, #TO EDIT
        "ReinsertionProbability"           : 8.8, #TO EDIT
        "SwapProbability"                  : 8.8, #TO EDIT
        "CreateNumberOfMolecules"          : 8, #TO EDIT
        }],
    })


    outputs = submit(
            DcMethane,
            structure=cif,
            zeopp_parameters = zeopp_params,
            raspa_parameters = raspa_params,
            atomic_radii = load_node('27d2af72-3af0-48a6-a563-24d1d6d6eb60'),
            zeopp_code=Code.get_from_string('zeopp@bazis1'),
            raspa_code=Code.get_from_string('raspa@bazis1'),
        )
    time.sleep(40)

Note

The function multiply_unit_cell is automatically computing the number of UnitCells needed, nx ny nz. This function contains the math to deal also with non-orthogonal unit cells.

Consult the Querying the AiiDA database part of the tutorial in order to find out which filter you should put in q.append(CifData, filters={}) to select the appropriate structures.