Merge branch 'master' into package-reorganization-step1

# Conflicts: # doc/src/Packages_details.rst
2021-07-02 13:24:49 -04:00
parent 411d1f1240 bf5934e581
commit 34fa0da60e
22 changed files with 1003 additions and 35 deletions
--- a/doc/src/Commands_compute.rst
+++ b/doc/src/Commands_compute.rst
@ -60,6 +60,7 @@ KOKKOS, o = OPENMP, t = OPT.
   * :doc:`erotate/sphere <compute_erotate_sphere>`
   * :doc:`erotate/sphere/atom <compute_erotate_sphere_atom>`
   * :doc:`event/displace <compute_event_displace>`
+   * :doc:`fabric <compute_fabric>`
   * :doc:`fep <compute_fep>`
   * :doc:`force/tally <compute_tally>`
   * :doc:`fragment/atom <compute_cluster_atom>`
--- a/doc/src/Howto_granular.rst
+++ b/doc/src/Howto_granular.rst
@ -17,6 +17,11 @@ This compute
 * :doc:`compute erotate/sphere <compute_erotate_sphere>`

 calculates rotational kinetic energy which can be :doc:`output with thermodynamic info <Howto_output>`.
+The compute
+
+* :doc:`compute fabric <compute_fabric>`
+
+calculates various versions of the fabric tensor for granular and non-granular pair styles.

 Use one of these 4 pair potentials, which compute forces and torques
 between interacting pairs of particles:
--- a/doc/src/Library_create.rst
+++ b/doc/src/Library_create.rst
@ -9,6 +9,7 @@ This section documents the following functions:
 - :cpp:func:`lammps_close`
 - :cpp:func:`lammps_mpi_init`
 - :cpp:func:`lammps_mpi_finalize`
+- :cpp:func:`lammps_kokkos_finalize`

 --------------------

--- a/doc/src/Python_create.rst
+++ b/doc/src/Python_create.rst
@ -134,7 +134,10 @@ compiled with.
 The :py:func:`lmp.close() <lammps.lammps.close()>` call is
 optional since the LAMMPS class instance will also be deleted
 automatically during the :py:class:`lammps <lammps.lammps>` class
-destructor.
+destructor.  Instead of :py:func:`lmp.close() <lammps.lammps.close()>`
+it is also possible to call :py:func:`lmp.finalize() <lammps.lammps.finalize()>`;
+this will destruct the LAMMPS instance, but also finalized and release
+the MPI and/or Kokkos environment if enabled and active.

 Note that you can create multiple LAMMPS objects in your Python
 script, and coordinate and run multiple simulations, e.g.
--- a/doc/src/compute.rst
+++ b/doc/src/compute.rst
@ -206,6 +206,7 @@ The individual style names on the :doc:`Commands compute <Commands_compute>` doc
 * :doc:`erotate/sphere <compute_erotate_sphere>` - rotational energy of spherical particles
 * :doc:`erotate/sphere/atom <compute_erotate_sphere_atom>` - rotational energy for each spherical particle
 * :doc:`event/displace <compute_event_displace>` - detect event on atom displacement
+* :doc:`fabric <compute_fabric>` - calculates fabric tensors from pair interactions
 * :doc:`fep <compute_fep>` -
 * :doc:`force/tally <compute_tally>` -
 * :doc:`fragment/atom <compute_cluster_atom>` - fragment ID for each atom
--- a/doc/src/compute_fabric.rst
+++ b/doc/src/compute_fabric.rst
@ -0,0 +1,185 @@
+.. index:: compute fabric
+
+compute fabric command
+======================
+
+Syntax
+""""""
+
+.. parsed-literal::
+
+   compute ID group-ID fabric cutoff attribute1 attribute2 ... keyword values ...
+
+* ID, group-ID are documented in :doc:`compute <compute>` command
+* fabric = style name of this compute command
+* cutoff = *type* or *radius*
+
+  .. parsed-literal::
+
+       *type* = cutoffs determined based on atom types
+       *radius* = cutoffs determined based on atom diameters (atom style sphere)
+
+* one or more attributes may be appended
+
+  .. parsed-literal::
+
+       *contact* = contact tensor
+       *branch* = branch tensor
+       *force/normal* = normal force tensor
+       *force/tangential* = tangential force tensor
+
+* zero or more keyword/value pairs may be appended
+* keyword = *type/include*
+
+  .. parsed-literal::
+
+       *type/include* value = arg1 arg2
+         arg = separate lists of types (see below)
+
+Examples
+""""""""
+
+.. code-block:: LAMMPS
+
+   compute 1 all fabric type contact force/normal type/include 1,2 3*4
+   compute 1 all fabric radius force/normal force/tangential
+
+Description
+"""""""""""
+
+Define a compute that calculates various fabric tensors for pairwise
+interaction :ref:`(Ouadfel) <Ouadfel>`. Fabric tensors are commonly used
+to quantify the anisotropy or orientation of granular contacts but can also
+be used to characterize the direction of pairwise interactions in general
+systems. The *type* and *radius* settings are used to select whether interactions
+cutoffs are determined by atom types or by the sum of atomic radii (atom
+style sphere), respectively. Calling this compute is roughly the cost of a
+pair style invocation as it involves a loop over the neighbor list. If the
+normal or tangential force tensors are requested, it will be more expensive
+than a pair style invocation as it will also recalculate all pair forces.
+
+Four fabric tensors are available: the contact, branch, normal force, or
+tangential force tensor. The contact tensor is calculated as
+
+.. math::
+
+   C_{ab}  =  \frac{15}{2} (\phi_{ab} - \mathrm{tr}(\phi) \delta_{ab})
+
+where :math:`a` and :math:`b` are the :math:`x`, :math:`y`, :math:`z`
+directions, :math:`\delta_{ab}` is the Kronecker delta function, and
+the tensor :math:`\phi` is defined as
+
+.. math::
+
+   \phi_{ab}  =  \sum_{n = 1}^{N_p} \frac{r_{a} r_{b}}{r^2}
+
+where :math:`n` loops over the :math:`N_p` pair interactions in the simulation,
+:math:`r_{a}` is the :math:`a` component of the radial vector between the
+two pairwise interacting particles, and :math:`r` is the magnitude of the radial vector.
+
+The branch tensor is calculated as
+
+.. math::
+
+   B_{ab}  =  \frac{15}{6 \mathrm{tr}(D)} (D_{ab} - \mathrm{tr}(D) \delta_{ab})
+
+where the tensor :math:`D` is defined as
+
+.. math::
+
+   D_{ab}  =  \sum_{n = 1}^{N_p}
+                \frac{1}{N_c (r^2 + C_{cd} r_c r_d)}
+                \frac{r_{a} r_{b}}{r}
+
+where :math:`N_c` is the total number of contacts in the system and the subscripts
+:math:`c` and :math:`d` indices are summed according to Einstein notation.
+
+The normal force fabric tensor is calculated as
+
+.. math::
+
+   F^n_{ab}  =  \frac{15}{6 \mathrm{tr}(N)} (N_{ab} - \mathrm{tr}(N) \delta_{ab})
+
+where the tensor :math:`N` is defined as
+
+.. math::
+
+   N_{ab}  =  \sum_{n = 1}^{N_p}
+                \frac{1}{N_c (r^2 + C_{cd} r_c r_d)}
+                \frac{r_{a} r_{b}}{r^2} f_n
+
+and :math:`f_n` is the magnitude of the normal, central-body force between the two atoms.
+
+Finally, the tangential force fabric tensor is only defined for pair styles that
+apply tangential forces to particles, namely granular pair styles. It is calculated
+as
+
+.. math::
+
+   F^t_{ab}  =  \frac{15}{9 \mathrm{tr}(N)} (T_{ab} - \mathrm{tr}(T) \delta_{ab})
+
+where the tensor :math:`T` is defined as
+
+.. math::
+
+   T_{ab}  =  \sum_{n = 1}^{N_p}
+                \frac{1}{N_c (r^2 + C_{cd} r_c r_d)}
+                \frac{r_{a} r_{b}}{r^2} f_t
+
+and :math:`f_t` is the magnitude of the tangential force between the two atoms.
+
+The *type/include* keyword filters interactions based on the types of the two atoms.
+Interactions between two atoms are only included in calculations if the atom types
+are in the two lists. Each list consists of a series of type
+ranges separated by commas. The range can be specified as a
+single numeric value, or a wildcard asterisk can be used to specify a range
+of values.  This takes the form "\*" or "\*n" or "n\*" or "m\*n".  For
+example, if M = the number of atom types, then an asterisk with no numeric
+values means all types from 1 to M.  A leading asterisk means all types
+from 1 to n (inclusive).  A trailing asterisk means all types from n to M
+(inclusive).  A middle asterisk means all types from m to n (inclusive).
+Multiple *type/include* keywords may be added.
+
+Output info
+"""""""""""
+
+This compute calculates a local vector of doubles and a scalar. The vector stores the
+unique components of the first requested tensor in the order xx, yy, zz, xy, xz, yz
+followed by the same components for all subsequent tensors. The length of the vector
+is therefore six times the number of requested tensors. The scalar output is the
+number of pairwise interactions included in the calculation of the fabric tensor.
+
+Restrictions
+""""""""""""
+
+This fix is part of the GRANULAR package.  It is only enabled if LAMMPS
+was built with that package.  See the :doc:`Build package <Build_package>`
+doc page for more info.
+
+Currently, compute *fabric* does not support pair styles
+with many-body interactions.  It also does not
+support models with long-range Coulombic or dispersion forces,
+i.e. the kspace_style command in LAMMPS.  It also does not support the
+following fixes which add rigid-body constraints: :doc:`fix shake
+<fix_shake>`, :doc:`fix rattle <fix_shake>`, :doc:`fix rigid
+<fix_rigid>`, :doc:`fix rigid/small <fix_rigid>`. It does not support
+granular pair styles that extend beyond the contact of atomic radii
+(e.g. JKR and DMT).
+
+Related commands
+""""""""""""""""
+
+none
+
+Default
+"""""""
+
+none
+
+----------
+
+.. _Ouadfel:
+
+**(Ouadfel)** Ouadfel and Rothenburg
+"Stress-force-fabric relationship for assemblies of ellipsoids",
+Mechanics of Materials (2001). (`link to paper <https://doi.org/10.1016/S0167-6636(00)00057-0>`_)
--- a/doc/utils/sphinx-config/false_positives.txt
+++ b/doc/utils/sphinx-config/false_positives.txt
@ -2680,6 +2680,7 @@ qoverride
 qqr
 qqrd
 qtb
+Quadfel
 quadratically
 quadrupolar
 Quant
@ -2861,6 +2862,7 @@ Rossky
 rosybrown
 rotationally
 Rotenberg
+Rothenburg
 Rovigatti
 royalblue
 rozero
--- a/fortran/lammps.f90
+++ b/fortran/lammps.f90
@ -76,17 +76,15 @@ MODULE LIBLAMMPS
        TYPE(c_ptr), VALUE :: handle
      END SUBROUTINE lammps_close

-      SUBROUTINE lammps_mpi_init(handle) BIND(C, name='lammps_mpi_init')
-        IMPORT :: c_ptr
-        TYPE(c_ptr), VALUE :: handle
+      SUBROUTINE lammps_mpi_init() BIND(C, name='lammps_mpi_init')
      END SUBROUTINE lammps_mpi_init

-      SUBROUTINE lammps_mpi_finalize(handle) &
-          BIND(C, name='lammps_mpi_finalize')
-        IMPORT :: c_ptr
-        TYPE(c_ptr), VALUE :: handle
+      SUBROUTINE lammps_mpi_finalize() BIND(C, name='lammps_mpi_finalize')
      END SUBROUTINE lammps_mpi_finalize

+      SUBROUTINE lammps_kokkos_finalize() BIND(C, name='lammps_kokkos_finalize')
+      END SUBROUTINE lammps_kokkos_finalize
+
      SUBROUTINE lammps_file(handle,filename) BIND(C, name='lammps_file')
        IMPORT :: c_ptr
        TYPE(c_ptr), VALUE :: handle
@ -188,7 +186,8 @@ CONTAINS

    IF (PRESENT(finalize)) THEN
        IF (finalize) THEN
-            CALL lammps_mpi_finalize(self%handle)
+            CALL lammps_kokkos_finalize()
+            CALL lammps_mpi_finalize()
        END IF
    END IF
  END SUBROUTINE lmp_close
--- a/python/lammps/core.py
+++ b/python/lammps/core.py
@ -460,10 +460,16 @@ class lammps(object):
  # -------------------------------------------------------------------------

  def finalize(self):
-    """Shut down the MPI communication through the library interface by calling :cpp:func:`lammps_finalize`.
+    """Shut down the MPI communication and Kokkos environment (if active) through the
+       library interface by  calling :cpp:func:`lammps_mpi_finalize` and
+       :cpp:func:`lammps_kokkos_finalize`.
+
+       You cannot create or use any LAMMPS instances after this function is called
+       unless LAMMPS was compiled without MPI and without Kokkos support.
    """
    self.close()
-    self.lib.lammps_finalize()
+    self.lib.lammps_kokkos_finalize()
+    self.lib.lammps_mpi_finalize()

  # -------------------------------------------------------------------------

--- a/src/.gitignore
+++ b/src/.gitignore
@ -386,6 +386,8 @@
 /compute_erotate_rigid.h
 /compute_event_displace.cpp
 /compute_event_displace.h
+/compute_fabric.cpp
+/compute_fabric.h
 /compute_fep.cpp
 /compute_fep.h
 /compute_force_tally.cpp
--- a/src/GRANULAR/compute_fabric.cpp
+++ b/src/GRANULAR/compute_fabric.cpp
@ -0,0 +1,595 @@
+/* ----------------------------------------------------------------------
+   LAMMPS - Large-scale Atomic/Molecular Massively Parallel Simulator
+   https://www.lammps.org/, Sandia National Laboratories
+   Steve Plimpton, sjplimp@sandia.gov
+
+   Copyright (2003) Sandia Corporation.  Under the terms of Contract
+   DE-AC04-94AL85000 with Sandia Corporation, the U.S. Government retains
+   certain rights in this software.  This software is distributed under
+   the GNU General Public License.
+
+   See the README file in the top-level LAMMPS directory.
+------------------------------------------------------------------------- */
+
+#include "compute_fabric.h"
+
+#include "atom.h"
+#include "error.h"
+#include "force.h"
+#include "memory.h"
+#include "modify.h"
+#include "neigh_list.h"
+#include "neigh_request.h"
+#include "neighbor.h"
+#include "pair.h"
+#include "tokenizer.h"
+#include "update.h"
+
+#include <cmath>
+#include <cstring>
+
+using namespace LAMMPS_NS;
+
+enum { OTHER, GRANULAR };
+enum { TYPE, RADIUS };
+enum { CN, BR, FN, FT };
+
+/* ---------------------------------------------------------------------- */
+
+ComputeFabric::ComputeFabric(LAMMPS *lmp, int narg, char **arg) :
+    Compute(lmp, narg, arg), tensor_style(NULL)
+{
+  if (narg < 3) error->all(FLERR, "Illegal compute fabric command");
+
+  if (strcmp(arg[3], "type") == 0)
+    cutstyle = TYPE;
+  else if (strcmp(arg[3], "radius") == 0)
+    cutstyle = RADIUS;
+  else
+    error->all(FLERR, "Illegal compute fabric command");
+
+  if (cutstyle == RADIUS && !atom->radius_flag)
+    error->all(FLERR, "Compute fabric radius style requires atom attribute radius");
+
+  // If optional arguments included, this will be oversized
+  ntensors = narg - 4;
+  tensor_style = new int[ntensors];
+
+  cn_flag = 0;
+  br_flag = 0;
+  fn_flag = 0;
+  ft_flag = 0;
+  type_filter = nullptr;
+
+  ntensors = 0;
+  int iarg = 4;
+  while (iarg < narg) {
+    if (strcmp(arg[iarg], "contact") == 0) {
+      cn_flag = 1;
+      tensor_style[ntensors++] = CN;
+    } else if (strcmp(arg[iarg], "branch") == 0) {
+      br_flag = 1;
+      tensor_style[ntensors++] = BR;
+    } else if (strcmp(arg[iarg], "force/normal") == 0) {
+      fn_flag = 1;
+      tensor_style[ntensors++] = FN;
+    } else if (strcmp(arg[iarg], "force/tangential") == 0) {
+      ft_flag = 1;
+      tensor_style[ntensors++] = FT;
+    } else if (strcmp(arg[iarg], "type/include") == 0) {
+      if (iarg + 1 >= narg) error->all(FLERR, "Invalid keyword in compute fabric command");
+      int ntypes = atom->ntypes;
+
+      int i, j, itype, jtype, in, jn, infield, jnfield;
+      int inlo, inhi, jnlo, jnhi;
+      char *istr, *jstr;
+      if (!type_filter) {
+        memory->create(type_filter, ntypes + 1, ntypes + 1, "compute/fabric:type_filter");
+
+        for (i = 0; i <= ntypes; i++) {
+          for (j = 0; j <= ntypes; j++) { type_filter[i][j] = 0; }
+        }
+      }
+
+      in = strlen(arg[iarg + 1]) + 1;
+      istr = new char[in];
+      strcpy(istr, arg[iarg + 1]);
+      std::vector<std::string> iwords = Tokenizer(istr, ",").as_vector();
+      infield = iwords.size();
+
+      jn = strlen(arg[iarg + 2]) + 1;
+      jstr = new char[jn];
+      strcpy(jstr, arg[iarg + 2]);
+      std::vector<std::string> jwords = Tokenizer(jstr, ",").as_vector();
+      jnfield = jwords.size();
+
+      for (i = 0; i < infield; i++) {
+        const char *ifield = iwords[i].c_str();
+        utils::bounds(FLERR, ifield, 1, ntypes, inlo, inhi, error);
+
+        for (j = 0; j < jnfield; j++) {
+          const char *jfield = jwords[j].c_str();
+          utils::bounds(FLERR, jfield, 1, ntypes, jnlo, jnhi, error);
+
+          for (itype = inlo; itype <= inhi; itype++) {
+            for (jtype = jnlo; jtype <= jnhi; jtype++) {
+              type_filter[itype][jtype] = 1;
+              type_filter[jtype][itype] = 1;
+            }
+          }
+        }
+      }
+
+      delete[] istr;
+      delete[] jstr;
+
+      iarg += 2;
+    } else
+      error->all(FLERR, "Illegal compute fabric command");
+    iarg++;
+  }
+
+  vector_flag = 1;
+  size_vector = ntensors * 6;
+  extvector = 0;
+
+  scalar_flag = 1;
+  extscalar = 1;
+
+  vector = new double[size_vector];
+}
+
+/* ---------------------------------------------------------------------- */
+
+ComputeFabric::~ComputeFabric()
+{
+  delete[] vector;
+  delete[] tensor_style;
+  memory->destroy(type_filter);
+}
+
+/* ---------------------------------------------------------------------- */
+
+void ComputeFabric::init()
+{
+  if (force->pair == NULL) error->all(FLERR, "No pair style is defined for compute fabric");
+  if (force->pair->single_enable == 0 && (fn_flag || ft_flag))
+    error->all(FLERR, "Pair style does not support compute fabric normal or tangential force");
+
+  // Find if granular or gran
+  pstyle = OTHER;
+  if (force->pair_match("^granular", 0) || force->pair_match("^gran/", 0)) pstyle = GRANULAR;
+
+  if (pstyle != GRANULAR && ft_flag)
+    error->all(FLERR, "Pair style does not calculate tangential forces for compute fabric");
+
+  if (force->pair->beyond_contact)
+    error->all(FLERR, "Compute fabric does not support pair styles that extend beyond contact");
+
+  // need an occasional half neighbor list
+  // set size to same value as request made by force->pair
+  // this should enable it to always be a copy list (e.g. for granular pstyle)
+
+  int irequest = neighbor->request(this, instance_me);
+  neighbor->requests[irequest]->pair = 0;
+  neighbor->requests[irequest]->compute = 1;
+  neighbor->requests[irequest]->occasional = 1;
+  NeighRequest *pairrequest = neighbor->find_request((void *) force->pair);
+  if (pairrequest) neighbor->requests[irequest]->size = pairrequest->size;
+}
+
+/* ---------------------------------------------------------------------- */
+
+void ComputeFabric::init_list(int /*id*/, NeighList *ptr)
+{
+  list = ptr;
+}
+
+/* ---------------------------------------------------------------------- */
+
+void ComputeFabric::compute_vector()
+{
+  invoked_vector = update->ntimestep;
+
+  int i, j, ii, jj, inum, jnum, itype, jtype;
+  tagint itag, jtag;
+  double xtmp, ytmp, ztmp, delx, dely, delz;
+  double r, rinv, rsq, radsum, eng, fpair;
+
+  double nx, ny, nz;
+  double ncinv, denom, fn, ft, prefactor;
+  double br_tensor[6], ft_tensor[6], fn_tensor[6];
+  double trace_phi, trace_D, trace_Xfn, trace_Xft;
+  double phi_ij[6] = {0.0};
+  double Ac_ij[6] = {0.0};
+  double D_ij[6] = {0.0};
+  double Xfn_ij[6] = {0.0};
+  double Xft_ij[6] = {0.0};
+  double temp_dbl[6];
+
+  int *ilist, *jlist, *numneigh, **firstneigh;
+
+  double **x = atom->x;
+  double *radius = atom->radius;
+  tagint *tag = atom->tag;
+  int *type = atom->type;
+  int *mask = atom->mask;
+  int nlocal = atom->nlocal;
+  int newton_pair = force->newton_pair;
+
+  // invoke half neighbor list (will copy or build if necessary)
+  neighbor->build_one(list);
+
+  inum = list->inum;
+  ilist = list->ilist;
+  numneigh = list->numneigh;
+  firstneigh = list->firstneigh;
+
+  Pair *pair = force->pair;
+  double **cutsq = force->pair->cutsq;
+
+  // invoke compute_scalar() to update the number of contacts, if needed
+  nc = compute_scalar();
+
+  // If no contacts, everything will be zero
+  if (nc == 0) {
+    for (i = 0; i < size_vector; i++) vector[i] = 0.0;
+    return;
+  }
+  ncinv = 1.0 / nc;
+
+  // First loop through and calculate contact fabric tensor
+  for (ii = 0; ii < inum; ii++) {
+    i = ilist[ii];
+    if (!(mask[i] & groupbit)) continue;
+
+    xtmp = x[i][0];
+    ytmp = x[i][1];
+    ztmp = x[i][2];
+    itag = tag[i];
+    itype = type[i];
+    jlist = firstneigh[i];
+    jnum = numneigh[i];
+
+    for (jj = 0; jj < jnum; jj++) {
+      j = jlist[jj];
+      j &= NEIGHMASK;
+
+      if (!(mask[j] & groupbit)) continue;
+
+      // itag = jtag is possible for long cutoffs that include images of self
+
+      if (newton_pair == 0 && j >= nlocal) {
+        jtag = tag[j];
+        if (itag > jtag) {
+          if ((itag + jtag) % 2 == 0) continue;
+        } else if (itag < jtag) {
+          if ((itag + jtag) % 2 == 1) continue;
+        } else {
+          if (x[j][2] < ztmp) continue;
+          if (x[j][2] == ztmp) {
+            if (x[j][1] < ytmp) continue;
+            if (x[j][1] == ytmp && x[j][0] < xtmp) continue;
+          }
+        }
+      }
+
+      jtype = type[j];
+
+      if (type_filter)
+        if (type_filter[itype][jtype] == 0) continue;
+
+      delx = xtmp - x[j][0];
+      dely = ytmp - x[j][1];
+      delz = ztmp - x[j][2];
+      rsq = delx * delx + dely * dely + delz * delz;
+
+      if (cutstyle == TYPE) {
+        if (rsq >= cutsq[itype][jtype]) continue;
+      } else {
+        radsum = radius[i] + radius[j];
+        if (rsq >= radsum * radsum) continue;
+      }
+
+      r = sqrt(rsq);
+      rinv = 1.0 / r;
+      nx = delx * rinv;
+      ny = dely * rinv;
+      nz = delz * rinv;
+
+      phi_ij[0] += nx * nx;
+      phi_ij[1] += ny * ny;
+      phi_ij[2] += nz * nz;
+      phi_ij[3] += nx * ny;
+      phi_ij[4] += nx * nz;
+      phi_ij[5] += ny * nz;
+    }
+  }
+
+  //Sum phi across processors
+  MPI_Allreduce(phi_ij, temp_dbl, 6, MPI_DOUBLE, MPI_SUM, world);
+  for (i = 0; i < 6; i++) phi_ij[i] = temp_dbl[i] * ncinv;
+
+  trace_phi = (1.0 / 3.0) * (phi_ij[0] + phi_ij[1] + phi_ij[2]);
+
+  Ac_ij[0] = (15.0 / 2.0) * (phi_ij[0] - trace_phi);
+  Ac_ij[1] = (15.0 / 2.0) * (phi_ij[1] - trace_phi);
+  Ac_ij[2] = (15.0 / 2.0) * (phi_ij[2] - trace_phi);
+  Ac_ij[3] = (15.0 / 2.0) * (phi_ij[3]);
+  Ac_ij[4] = (15.0 / 2.0) * (phi_ij[4]);
+  Ac_ij[5] = (15.0 / 2.0) * (phi_ij[5]);
+
+  // If needed, loop through and calculate other fabric tensors
+  if (br_flag || fn_flag || ft_flag) {
+
+    for (ii = 0; ii < inum; ii++) {
+      i = ilist[ii];
+      if (!(mask[i] & groupbit)) continue;
+
+      xtmp = x[i][0];
+      ytmp = x[i][1];
+      ztmp = x[i][2];
+      itag = tag[i];
+      itype = type[i];
+      jlist = firstneigh[i];
+      jnum = numneigh[i];
+
+      for (jj = 0; jj < jnum; jj++) {
+        j = jlist[jj];
+        j &= NEIGHMASK;
+
+        if (!(mask[j] & groupbit)) continue;
+
+        // itag = jtag is possible for long cutoffs that include images of self
+
+        if (newton_pair == 0 && j >= nlocal) {
+          jtag = tag[j];
+          if (itag > jtag) {
+            if ((itag + jtag) % 2 == 0) continue;
+          } else if (itag < jtag) {
+            if ((itag + jtag) % 2 == 1) continue;
+          } else {
+            if (x[j][2] < ztmp) continue;
+            if (x[j][2] == ztmp) {
+              if (x[j][1] < ytmp) continue;
+              if (x[j][1] == ytmp && x[j][0] < xtmp) continue;
+            }
+          }
+        }
+
+        jtype = type[j];
+
+        if (type_filter)
+          if (type_filter[itype][jtype] == 0) continue;
+
+        delx = xtmp - x[j][0];
+        dely = ytmp - x[j][1];
+        delz = ztmp - x[j][2];
+        rsq = delx * delx + dely * dely + delz * delz;
+
+        if (cutstyle == TYPE) {
+          if (rsq >= cutsq[itype][jtype]) continue;
+        } else {
+          radsum = radius[i] + radius[j];
+          if (rsq >= radsum * radsum) continue;
+        }
+
+        if (fn_flag || ft_flag) eng = pair->single(i, j, itype, jtype, rsq, 1.0, 1.0, fpair);
+
+        r = sqrt(rsq);
+        rinv = 1.0 / r;
+        nx = delx * rinv;
+        ny = dely * rinv;
+        nz = delz * rinv;
+
+        denom = 1 + Ac_ij[0] * nx * nx + Ac_ij[1] * ny * ny + Ac_ij[2] * nz * nz;
+        denom += 2 * Ac_ij[3] * nx * ny + 2 * Ac_ij[4] * nx * nz + 2 * Ac_ij[5] * ny * nz;
+        prefactor = ncinv / denom;
+
+        if (br_flag) {
+          D_ij[0] += prefactor * nx * nx * r;
+          D_ij[1] += prefactor * ny * ny * r;
+          D_ij[2] += prefactor * nz * nz * r;
+          D_ij[3] += prefactor * nx * ny * r;
+          D_ij[4] += prefactor * nx * nz * r;
+          D_ij[5] += prefactor * ny * nz * r;
+        }
+
+        if (fn_flag || ft_flag) {
+          fn = r * fpair;
+
+          Xfn_ij[0] += prefactor * nx * nx * fn;
+          Xfn_ij[1] += prefactor * ny * ny * fn;
+          Xfn_ij[2] += prefactor * nz * nz * fn;
+          Xfn_ij[3] += prefactor * nx * ny * fn;
+          Xfn_ij[4] += prefactor * nx * nz * fn;
+          Xfn_ij[5] += prefactor * ny * nz * fn;
+
+          if (ft_flag) {
+            ft = force->pair->svector[3];
+
+            Xft_ij[0] += prefactor * nx * nx * ft;
+            Xft_ij[1] += prefactor * ny * ny * ft;
+            Xft_ij[2] += prefactor * nz * nz * ft;
+            Xft_ij[3] += prefactor * nx * ny * ft;
+            Xft_ij[4] += prefactor * nx * nz * ft;
+            Xft_ij[5] += prefactor * ny * nz * ft;
+          }
+        }
+      }
+    }
+  }
+
+  // Output results
+
+  if (cn_flag) {
+    for (i = 0; i < ntensors; i++) {
+      if (tensor_style[i] == CN) {
+        for (j = 0; j < 6; j++) vector[6 * i + j] = Ac_ij[j];
+      }
+    }
+  }
+
+  if (br_flag) {
+    MPI_Allreduce(D_ij, temp_dbl, 6, MPI_DOUBLE, MPI_SUM, world);
+    for (i = 0; i < 6; i++) D_ij[i] = temp_dbl[i];
+
+    trace_D = (1.0 / 3.0) * (D_ij[0] + D_ij[1] + D_ij[2]);
+
+    br_tensor[0] = (15.0 / (6.0 * trace_D)) * (D_ij[0] - trace_D);
+    br_tensor[1] = (15.0 / (6.0 * trace_D)) * (D_ij[1] - trace_D);
+    br_tensor[2] = (15.0 / (6.0 * trace_D)) * (D_ij[2] - trace_D);
+    br_tensor[3] = (15.0 / (6.0 * trace_D)) * (D_ij[3]);
+    br_tensor[4] = (15.0 / (6.0 * trace_D)) * (D_ij[4]);
+    br_tensor[5] = (15.0 / (6.0 * trace_D)) * (D_ij[5]);
+
+    for (i = 0; i < ntensors; i++) {
+      if (tensor_style[i] == BR) {
+        for (j = 0; j < 6; j++) vector[6 * i + j] = br_tensor[j];
+      }
+    }
+  }
+
+  if (fn_flag || ft_flag) {
+    MPI_Allreduce(Xfn_ij, temp_dbl, 6, MPI_DOUBLE, MPI_SUM, world);
+    for (i = 0; i < 6; i++) Xfn_ij[i] = temp_dbl[i];
+
+    trace_Xfn = (1.0 / 3.0) * (Xfn_ij[0] + Xfn_ij[1] + Xfn_ij[2]);
+  }
+
+  if (fn_flag) {
+
+    fn_tensor[0] = (15.0 / (6.0 * trace_Xfn)) * (Xfn_ij[0] - trace_Xfn);
+    fn_tensor[1] = (15.0 / (6.0 * trace_Xfn)) * (Xfn_ij[1] - trace_Xfn);
+    fn_tensor[2] = (15.0 / (6.0 * trace_Xfn)) * (Xfn_ij[2] - trace_Xfn);
+    fn_tensor[3] = (15.0 / (6.0 * trace_Xfn)) * (Xfn_ij[3]);
+    fn_tensor[4] = (15.0 / (6.0 * trace_Xfn)) * (Xfn_ij[4]);
+    fn_tensor[5] = (15.0 / (6.0 * trace_Xfn)) * (Xfn_ij[5]);
+
+    for (i = 0; i < ntensors; i++) {
+      if (tensor_style[i] == FN) {
+        for (j = 0; j < 6; j++) vector[6 * i + j] = fn_tensor[j];
+      }
+    }
+  }
+
+  if (ft_flag) {
+    MPI_Allreduce(Xft_ij, temp_dbl, 6, MPI_DOUBLE, MPI_SUM, world);
+    for (i = 0; i < 6; i++) Xft_ij[i] = temp_dbl[i];
+
+    trace_Xft = (1.0 / 3.0) * (Xft_ij[0] + Xft_ij[1] + Xft_ij[2]);
+
+    ft_tensor[0] = (15.0 / (9.0 * trace_Xfn)) * (Xft_ij[0] - trace_Xft);
+    ft_tensor[1] = (15.0 / (9.0 * trace_Xfn)) * (Xft_ij[1] - trace_Xft);
+    ft_tensor[2] = (15.0 / (9.0 * trace_Xfn)) * (Xft_ij[2] - trace_Xft);
+    ft_tensor[3] = (15.0 / (9.0 * trace_Xfn)) * (Xft_ij[3]);
+    ft_tensor[4] = (15.0 / (9.0 * trace_Xfn)) * (Xft_ij[4]);
+    ft_tensor[5] = (15.0 / (9.0 * trace_Xfn)) * (Xft_ij[5]);
+
+    for (i = 0; i < ntensors; i++) {
+      if (tensor_style[i] == FT) {
+        for (j = 0; j < 6; j++) vector[6 * i + j] = ft_tensor[j];
+      }
+    }
+  }
+}
+
+/* ---------------------------------------------------------------------- */
+
+double ComputeFabric::compute_scalar()
+{
+  // Skip if already calculated on this timestep
+  if (invoked_scalar == update->ntimestep) return nc;
+
+  invoked_scalar = update->ntimestep;
+
+  int i, j, ii, jj, inum, jnum, itype, jtype;
+  tagint itag, jtag;
+  double xtmp, ytmp, ztmp, delx, dely, delz;
+  double rsq, radsum, temp_dbl;
+
+  int *ilist, *jlist, *numneigh, **firstneigh;
+
+  double **x = atom->x;
+  double *radius = atom->radius;
+  tagint *tag = atom->tag;
+  int *type = atom->type;
+  int *mask = atom->mask;
+  int nlocal = atom->nlocal;
+  int newton_pair = force->newton_pair;
+
+  // invoke half neighbor list (will copy or build if necessary)
+  neighbor->build_one(list);
+
+  inum = list->inum;
+  ilist = list->ilist;
+  numneigh = list->numneigh;
+  firstneigh = list->firstneigh;
+
+  double **cutsq = force->pair->cutsq;
+
+  // First loop through and calculate contact fabric tensor
+  nc = 0;
+  for (ii = 0; ii < inum; ii++) {
+    i = ilist[ii];
+    if (!(mask[i] & groupbit)) continue;
+
+    xtmp = x[i][0];
+    ytmp = x[i][1];
+    ztmp = x[i][2];
+    itag = tag[i];
+    itype = type[i];
+    jlist = firstneigh[i];
+    jnum = numneigh[i];
+
+    for (jj = 0; jj < jnum; jj++) {
+      j = jlist[jj];
+      j &= NEIGHMASK;
+
+      if (!(mask[j] & groupbit)) continue;
+
+      // itag = jtag is possible for long cutoffs that include images of self
+
+      if (newton_pair == 0 && j >= nlocal) {
+        jtag = tag[j];
+        if (itag > jtag) {
+          if ((itag + jtag) % 2 == 0) continue;
+        } else if (itag < jtag) {
+          if ((itag + jtag) % 2 == 1) continue;
+        } else {
+          if (x[j][2] < ztmp) continue;
+          if (x[j][2] == ztmp) {
+            if (x[j][1] < ytmp) continue;
+            if (x[j][1] == ytmp && x[j][0] < xtmp) continue;
+          }
+        }
+      }
+
+      jtype = type[j];
+
+      if (type_filter)
+        if (type_filter[itype][jtype] == 0) continue;
+
+      delx = xtmp - x[j][0];
+      dely = ytmp - x[j][1];
+      delz = ztmp - x[j][2];
+      rsq = delx * delx + dely * dely + delz * delz;
+
+      if (cutstyle == TYPE) {
+        if (rsq >= cutsq[itype][jtype]) continue;
+      } else {
+        radsum = radius[i] + radius[j];
+        if (rsq >= radsum * radsum) continue;
+      }
+
+      nc += 1.0;
+    }
+  }
+  //Count total contacts across processors
+  MPI_Allreduce(&nc, &temp_dbl, 1, MPI_DOUBLE, MPI_SUM, world);
+  nc = temp_dbl;
+
+  scalar = nc;
+  return nc;
+}
+
--- a/src/GRANULAR/compute_fabric.h
+++ b/src/GRANULAR/compute_fabric.h
@ -0,0 +1,79 @@
+/* -*- c++ -*- ----------------------------------------------------------
+   LAMMPS - Large-scale Atomic/Molecular Massively Parallel Simulator
+   https://www.lammps.org/, Sandia National Laboratories
+   Steve Plimpton, sjplimp@sandia.gov
+
+   Copyright (2003) Sandia Corporation.  Under the terms of Contract
+   DE-AC04-94AL85000 with Sandia Corporation, the U.S. Government retains
+   certain rights in this software.  This software is distributed under
+   the GNU General Public License.
+
+   See the README file in the top-level LAMMPS directory.
+------------------------------------------------------------------------- */
+
+#ifdef COMPUTE_CLASS
+// clang-format off
+ComputeStyle(fabric,ComputeFabric);
+// clang-format on
+#else
+
+#ifndef LMP_COMPUTE_FABRIC_H
+#define LMP_COMPUTE_FABRIC_H
+
+#include "compute.h"
+
+namespace LAMMPS_NS {
+
+class ComputeFabric : public Compute {
+ public:
+  ComputeFabric(class LAMMPS *, int, char **);
+  ~ComputeFabric();
+  void init();
+  void init_list(int, class NeighList *);
+  void compute_vector();
+  double compute_scalar();
+
+ private:
+  int ntensors, pstyle, cutstyle;
+  double nc;
+  int *tensor_style;
+  int **type_filter;
+  class NeighList *list;
+
+  int cn_flag, br_flag, fn_flag, ft_flag;
+};
+
+}    // namespace LAMMPS_NS
+
+#endif
+#endif
+
+/* ERROR/WARNING messages:
+
+E: Illegal ... command
+
+Self-explanatory.  Check the input script syntax and compare to the
+documentation for the command.  You can use -echo screen as a
+command-line option when running LAMMPS to see the offending line.
+
+E: Compute fabric radius style requires atom attribute radius
+
+Self-explanatory.
+
+E: No pair style is defined for compute fabric
+
+Self-explanatory.
+
+E: Pair style does not support compute fabric normal or tangential force
+
+Pair style must be single enabled to calculate the normal or tangential force tensors
+
+E: Pair style does not calculate tangential forces for compute fabric
+
+The tangential force tensor can only be calculated for granular pair styles with tangential forces
+
+E: Compute fabric does not support pair styles that extend beyond contact
+
+Granular pair styles that extend beyond contact such as JKR or DMT are not supported
+
+*/
--- a/src/KOKKOS/kokkos.cpp
+++ b/src/KOKKOS/kokkos.cpp
@ -69,6 +69,10 @@ GPU_AWARE_UNKNOWN

 using namespace LAMMPS_NS;

+Kokkos::InitArguments KokkosLMP::args{-1, -1, -1, false};
+int KokkosLMP::is_finalized = 0;
+int KokkosLMP::init_ngpus = 0;
+
 /* ---------------------------------------------------------------------- */

 KokkosLMP::KokkosLMP(LAMMPS *lmp, int narg, char **arg) : Pointers(lmp)
@ -155,6 +159,10 @@ KokkosLMP::KokkosLMP(LAMMPS *lmp, int narg, char **arg) : Pointers(lmp)
    } else if (strcmp(arg[iarg],"t") == 0 ||
               strcmp(arg[iarg],"threads") == 0) {
      nthreads = atoi(arg[iarg+1]);
+
+      if (nthreads <= 0)
+        error->all(FLERR,"Invalid number of threads requested for Kokkos: must be 1 or greater");
+
      iarg += 2;

    } else if (strcmp(arg[iarg],"n") == 0 ||
@ -165,13 +173,27 @@ KokkosLMP::KokkosLMP(LAMMPS *lmp, int narg, char **arg) : Pointers(lmp)
    } else error->all(FLERR,"Invalid Kokkos command-line args");
  }

-  // initialize Kokkos
+  // Initialize Kokkos. However, we cannot change any
+  // Kokkos library parameters after the first initalization

-  if (me == 0) {
-    if (screen) fprintf(screen,"  will use up to %d GPU(s) per node\n",ngpus);
-    if (logfile) fprintf(logfile,"  will use up to %d GPU(s) per node\n",ngpus);
+  if (args.num_threads != -1) {
+    if (args.num_threads != nthreads || args.num_numa != numa || args.device_id != device)
+      if (me == 0)
+        error->warning(FLERR,"Kokkos package already initalized, cannot reinitialize with different parameters");
+    nthreads = args.num_threads;
+    numa = args.num_numa;
+    device = args.device_id;
+    ngpus = init_ngpus;
+  } else {
+    args.num_threads = nthreads;
+    args.num_numa = numa;
+    args.device_id = device;
+    init_ngpus = ngpus;
  }

+  if (me == 0)
+    utils::logmesg(lmp, "  will use up to {} GPU(s) per node\n",ngpus);
+
 #ifdef LMP_KOKKOS_GPU
  if (ngpus <= 0)
    error->all(FLERR,"Kokkos has been compiled for CUDA, HIP, or SYCL but no GPUs are requested");
@ -184,12 +206,7 @@ KokkosLMP::KokkosLMP(LAMMPS *lmp, int narg, char **arg) : Pointers(lmp)
                         "than the OpenMP backend");
 #endif

-  Kokkos::InitArguments args;
-  args.num_threads = nthreads;
-  args.num_numa = numa;
-  args.device_id = device;
-
-  Kokkos::initialize(args);
+  KokkosLMP::initialize(args,error);

  // default settings for package kokkos command

@ -299,9 +316,27 @@ KokkosLMP::KokkosLMP(LAMMPS *lmp, int narg, char **arg) : Pointers(lmp)

 KokkosLMP::~KokkosLMP()
 {
-  // finalize Kokkos

+}
+
+/* ---------------------------------------------------------------------- */
+
+void KokkosLMP::initialize(Kokkos::InitArguments args, Error *error)
+{
+  if (!Kokkos::is_initialized()) {
+    if (is_finalized)
+      error->all(FLERR,"Kokkos package already finalized, cannot re-initialize");
+    Kokkos::initialize(args);
+  }
+}
+
+/* ---------------------------------------------------------------------- */
+
+void KokkosLMP::finalize()
+{
+  if (Kokkos::is_initialized() && !is_finalized)
    Kokkos::finalize();
+  is_finalized = 1;
 }

 /* ----------------------------------------------------------------------
--- a/src/KOKKOS/kokkos.h
+++ b/src/KOKKOS/kokkos.h
@ -49,8 +49,14 @@ class KokkosLMP : protected Pointers {
  int newtonflag;
  double binsize;

+  static int is_finalized;
+  static Kokkos::InitArguments args;
+  static int init_ngpus;
+
  KokkosLMP(class LAMMPS *, int, char **);
  ~KokkosLMP();
+  static void initialize(Kokkos::InitArguments, Error *);
+  static void finalize();
  void accelerator(int, char **);
  int neigh_count(int);

@ -84,13 +90,21 @@ because MPI library not recognized

 The local MPI rank was not found in one of four supported environment variables.

+E: Invalid number of threads requested for Kokkos: must be 1 or greater
+
+Self-explanatory.
+
 E: GPUs are requested but Kokkos has not been compiled for CUDA

 Recompile Kokkos with CUDA support to use GPUs.

-E: Kokkos has been compiled for CUDA but no GPUs are requested
+E: Kokkos has been compiled for CUDA, HIP, or SYCL but no GPUs are requested

-One or more GPUs must be used when Kokkos is compiled for CUDA.
+One or more GPUs must be used when Kokkos is compiled for CUDA/HIP/SYCL.
+
+W: Kokkos package already initalized, cannot reinitialize with different parameters
+
+Self-explanatory.

 E: Illegal ... command

--- a/src/SPIN/fix_nve_spin.cpp
+++ b/src/SPIN/fix_nve_spin.cpp
@ -176,6 +176,7 @@ void FixNVESpin::init()

  // loop 1: obtain # of Pairs, and # of Pair/Spin styles

+  npairspin = 0;
  PairHybrid *hybrid = (PairHybrid *)force->pair_match("^hybrid",0);
  if (force->pair_match("^spin",0,0)) {        // only one Pair/Spin style
    pair = force->pair_match("^spin",0,0);
@ -232,6 +233,7 @@ void FixNVESpin::init()
  // loop 1: obtain # of fix precession/spin styles

  int iforce;
+  nprecspin = 0;
  for (iforce = 0; iforce < modify->nfix; iforce++) {
    if (utils::strmatch(modify->fix[iforce]->style,"^precession/spin")) {
      nprecspin++;
@ -264,6 +266,7 @@ void FixNVESpin::init()

  // loop 1: obtain # of fix langevin/spin styles

+  nlangspin = 0;
  for (iforce = 0; iforce < modify->nfix; iforce++) {
    if (utils::strmatch(modify->fix[iforce]->style,"^langevin/spin")) {
      nlangspin++;
--- a/src/accelerator_kokkos.h
+++ b/src/accelerator_kokkos.h
@ -56,16 +56,12 @@ class KokkosLMP {

  KokkosLMP(class LAMMPS *, int, char **) { kokkos_exists = 0; }
  ~KokkosLMP() {}
+  static void finalize() {}
  void accelerator(int, char **) {}
  int neigh_list_kokkos(int) { return 0; }
  int neigh_count(int) { return 0; }
 };

-class Kokkos {
- public:
-  static void finalize() {}
-};
-
 class AtomKokkos : public Atom {
 public:
  tagint **k_special;
--- a/src/error.cpp
+++ b/src/error.cpp
@ -81,7 +81,7 @@ void Error::universe_all(const std::string &file, int line, const std::string &s

  throw LAMMPSException(mesg);
 #else
-  if (lmp->kokkos) Kokkos::finalize();
+  KokkosLMP::finalize();
  MPI_Finalize();
  exit(1);
 #endif
@ -107,6 +107,7 @@ void Error::universe_one(const std::string &file, int line, const std::string &s

  throw LAMMPSAbortException(mesg, universe->uworld);
 #else
+  KokkosLMP::finalize();
  MPI_Abort(universe->uworld,1);
  exit(1); // to trick "smart" compilers into believing this does not return
 #endif
@ -173,8 +174,8 @@ void Error::all(const std::string &file, int line, const std::string &str)
  if (screen && screen != stdout) fclose(screen);
  if (logfile) fclose(logfile);

+  KokkosLMP::finalize();
  if (universe->nworlds > 1) MPI_Abort(universe->uworld,1);
-  if (lmp->kokkos) Kokkos::finalize();
  MPI_Finalize();
  exit(1);
 #endif
@ -213,6 +214,7 @@ void Error::one(const std::string &file, int line, const std::string &str)
 #else
  if (screen) fflush(screen);
  if (logfile) fflush(logfile);
+  KokkosLMP::finalize();
  MPI_Abort(world,1);
  exit(1); // to trick "smart" compilers into believing this does not return
 #endif
@ -315,7 +317,7 @@ void Error::done(int status)
  if (screen && screen != stdout) fclose(screen);
  if (logfile) fclose(logfile);

-  if (lmp->kokkos) Kokkos::finalize();
+  KokkosLMP::finalize();
  MPI_Finalize();
  exit(status);
 }
--- a/src/library.cpp
+++ b/src/library.cpp
@ -19,6 +19,7 @@
 #include "library.h"
 #include <mpi.h>

+#include "accelerator_kokkos.h"
 #include "atom.h"
 #include "atom_vec.h"
 #include "comm.h"
@ -333,8 +334,8 @@ The MPI standard requires that any MPI application calls
 do any MPI calls, MPI is still initialized internally to avoid errors
 accessing any MPI functions.  This function should then be called right
 before exiting the program to wait until all (parallel) tasks are
-completed and then MPI is cleanly shut down.  After this function no
-more MPI calls may be made.
+completed and then MPI is cleanly shut down.  After calling this
+function no more MPI calls may be made.

 .. versionadded:: 18Sep2020

@ -353,6 +354,28 @@ void lammps_mpi_finalize()
  }
 }

+/* ---------------------------------------------------------------------- */
+
+/** Shut down the Kokkos library environment.
+ *
+\verbatim embed:rst
+
+The Kokkos library may only be initialized once during the execution of
+a process.  This is done automatically the first time Kokkos
+functionality is used.  This requires that the Kokkos environment
+must be explicitly shut down after any LAMMPS instance using it is
+closed (to release associated resources).
+After calling this function no Kokkos functionality may be used.
+
+.. versionadded:: TBD
+
+\endverbatim */
+
+void lammps_kokkos_finalize()
+{
+  KokkosLMP::finalize();
+}
+
 // ----------------------------------------------------------------------
 // Library functions to process commands
 // ----------------------------------------------------------------------
--- a/src/library.h
+++ b/src/library.h
@ -94,6 +94,7 @@ void lammps_close(void *handle);

 void lammps_mpi_init();
 void lammps_mpi_finalize();
+void lammps_kokkos_finalize();

 /* ----------------------------------------------------------------------
 * Library functions to process commands
--- a/src/main.cpp
+++ b/src/main.cpp
@ -14,6 +14,7 @@
 #include "lammps.h"

 #include "input.h"
+#include "accelerator_kokkos.h"
 #if defined(LAMMPS_EXCEPTIONS)
 #include "exceptions.h"
 #endif
@ -77,13 +78,16 @@ int main(int argc, char **argv)
    lammps->input->file();
    delete lammps;
  } catch (LAMMPSAbortException &ae) {
+    KokkosLMP::finalize();
    MPI_Abort(ae.universe, 1);
  } catch (LAMMPSException &e) {
+    KokkosLMP::finalize();
    MPI_Barrier(lammps_comm);
    MPI_Finalize();
    exit(1);
  } catch (fmt::format_error &fe) {
    fprintf(stderr, "fmt::format_error: %s\n", fe.what());
+    KokkosLMP::finalize();
    MPI_Abort(MPI_COMM_WORLD, 1);
    exit(1);
  }
@ -94,10 +98,12 @@ int main(int argc, char **argv)
    delete lammps;
  } catch (fmt::format_error &fe) {
    fprintf(stderr, "fmt::format_error: %s\n", fe.what());
+    KokkosLMP::finalize();
    MPI_Abort(MPI_COMM_WORLD, 1);
    exit(1);
  }
 #endif
+  KokkosLMP::finalize();
  MPI_Barrier(lammps_comm);
  MPI_Finalize();
 }
--- a/tools/swig/lammps.i
+++ b/tools/swig/lammps.i
@ -63,6 +63,7 @@ extern void  *lammps_open_fortran(int argc, char **argv, int f_comm);
 extern void   lammps_close(void *handle);
 extern void   lammps_mpi_init();
 extern void   lammps_mpi_finalize();
+extern void   lammps_kokkos_finalize();
 extern void   lammps_file(void *handle, const char *file);
 extern char  *lammps_command(void *handle, const char *cmd);
 extern void   lammps_commands_list(void *handle, int ncmd, const char **cmds);
@ -185,6 +186,7 @@ extern void  *lammps_open_fortran(int argc, char **argv, int f_comm);
 extern void   lammps_close(void *handle);
 extern void   lammps_mpi_init();
 extern void   lammps_mpi_finalize();
+extern void   lammps_kokkos_finalize();
 extern void   lammps_file(void *handle, const char *file);
 extern char  *lammps_command(void *handle, const char *cmd);
 extern void   lammps_commands_list(void *handle, int ncmd, const char **cmds);
--- a/unittest/python/python-capabilities.py
+++ b/unittest/python/python-capabilities.py
@ -165,6 +165,13 @@ class PythonCapabilities(unittest.TestCase):
            if self.cmake_cache['GPU_PREC'].lower() == 'single':
                 self.assertIn('single',settings['GPU']['precision'])

+        if self.cmake_cache['PKG_KOKKOS']:
+            if self.cmake_cache['Kokkos_ENABLE_OPENMP']:
+                self.assertIn('openmp',settings['KOKKOS']['api'])
+            if self.cmake_cache['Kokkos_ENABLE_SERIAL']:
+                self.assertIn('serial',settings['KOKKOS']['api'])
+            self.assertIn('double',settings['KOKKOS']['precision'])
+
    def test_gpu_device(self):

        info = self.lmp.get_gpu_device_info()