6fe16c7606
Added pair style sph/lj/gpu in src/GPU, removed commented codes
2023-12-07 14:33:47 -06:00
4244d2e6cd
silence compiler warnings about unused parameters and variables
2023-01-19 08:56:54 -05:00
a3cc0e8432
Reverted the block size tuning, which caused bugs for low atom counts (will revisit later)
2022-11-04 13:45:59 -05:00
00f46120c7
Removed max_cus() from Device, used device->gpu->cus() instead
2022-10-07 15:50:30 -05:00
6b9e83fe20
Added timing for the induced dipole spreading part, computed the block size to ensure all the CUs are occupied by the fphi_uind and fphi_mpole kernels
2022-10-06 15:03:58 -05:00
9a1f23a079
Cosmetic changes and cleanup
2022-09-30 17:32:25 -05:00
1d75ca3b20
Moved precompute() out of the terms in amoeba and hippo, to be involed in the first term in a time step: multipole for amoeba and repulsion for hippo
2022-09-30 16:31:13 -05:00
785131932c
Added fphi_mpole in amoeba/gpu, fixed a bug in the kernel when indexing grid
2022-09-20 13:58:17 -05:00
62ecf98cda
Enabled fphi_uind in hippo/gpu, really need to refactor hippo and amoeba in the GPU lib to remove kernel duplicates
2022-09-16 14:47:16 -05:00
880f20c285
Cleaned up kernels
2022-09-15 15:29:14 -05:00
cad7e1b364
Moved fphi_uind up to BaseAmoeba
2022-09-02 10:18:59 -05:00
aac264f2e2
Working on the fphi_uind kernel and array allocations
2022-08-30 23:40:04 -05:00
9e7bbad4d4
Working on fphi_uind in the GPU lib
2022-08-27 13:19:52 -05:00
0f0f6a51de
Renamed sp_polar to sp_amoeba, and replaced special_wscale with special_hal for amoeba
2021-10-02 16:02:44 -05:00
3328ac0df2
Attempted to remove some redundancy in data transfers in the amoeba kernels; keeping HIPPO independent of AMOEBA for now
2021-10-01 09:58:21 -05:00
b874feb127
Removed trailing spaces
2021-09-28 17:28:33 -05:00
d77d5b7f0a
Added classes for hippo/gpu, refactored BaseAmoeba and made room for the dispersion real-space term in hippo
2021-09-21 15:40:06 -05:00
a2fd784034
Added the dispersion real space term, which is for HIPPO.
2021-09-21 10:55:38 -05:00
0228867d8e
Added the dispersion real space kernel and transfer special coeffs to the device
2021-09-19 23:40:43 -05:00
1166845fcf
Prepared data structure for the dispersion real-space term
2021-09-18 10:22:22 -05:00
5d801e985f
More cleanup
2021-09-17 23:24:23 -05:00
2e6df83b9b
Fixed bugs in the multipole real-space part on the GPU; separately multipole real and polar real work correctly (along with udirect2b and umutual2b), but
...
together they are conflicting due to the use of ans to copy forces back from device to host. The other 2 kernels (induce part) do not touch forces and energies.
2021-09-17 15:24:36 -05:00
d926705950
Short neighbor list for multipole real-space should be built with off2_mpole
2021-09-17 01:32:00 -05:00
003bebd31e
Working on the multipole real-space term, not ready yet
2021-09-17 01:19:33 -05:00
c0b967054e
Fixed bugs with zero local atoms (similar to what has been done to PPPM interp)
2021-09-16 17:27:44 -05:00
98c1a0178c
Refactored the API so that different off2 values are used for different kernels
2021-09-16 17:14:36 -05:00
a21095fded
More cleaning up
2021-09-13 13:47:15 -05:00
c765861851
Cleaned up and re-arranged the functions to reflect the order of calling in a time step
2021-09-11 01:00:58 -05:00
7f5a82dc54
Switched to the short neighbor list implementation in the pre-10Feb21 version (the recent version enforces tpa = 1 for short nbor)
2021-09-11 00:34:43 -05:00
4ebe5833d3
Working on short nbor list for the amoeba kernels (based on what has been done with tersoff and ellipsod, nbor dev_packed needs to be allocated properly)
2021-09-10 16:51:16 -05:00
efe0bf593f
Adding the umutual2b kernel, need to create another array for tdipdip on the GPU
2021-09-09 15:19:43 -05:00
7d69a870a4
Reverted the binsize function call from the GPU package in Atom, instead added atom_modify sort with a binsize to ensure matching virial values, enabled the udirect2b kernel, need more work to override dfield0c, and induce() to bypass reverse_comm() for field and fieldp (line amoeba_induce.cpp:111-112)
2021-09-03 13:43:22 -05:00
785a794d39
Added and renamed API to make room for additional kernels (udirect2b only computes the field and fieldp, not accumulating forces, energies, nor virials)
2021-09-01 14:37:11 -05:00
07b60827c4
Working on the udirect2b kernel for the induce real space term, need to add the API for the GPU library
2021-09-01 12:30:41 -05:00
3825fee8e9
Added work on amoeba/gpu, some minor changes to PairAmoeba to allow function overriding in PairAmoebaGPU, added the package AMOEBA to cmake/CMakeLists.txt
2021-08-25 22:57:37 -05:00