|
|
00f46120c7
|
Removed max_cus() from Device, used device->gpu->cus() instead
|
2022-10-07 15:50:30 -05:00 |
|
|
|
6b9e83fe20
|
Added timing for the induced dipole spreading part, computed the block size to ensure all the CUs are occupied by the fphi_uind and fphi_mpole kernels
|
2022-10-06 15:03:58 -05:00 |
|
|
|
9a1f23a079
|
Cosmetic changes and cleanup
|
2022-09-30 17:32:25 -05:00 |
|
|
|
1d75ca3b20
|
Moved precompute() out of the terms in amoeba and hippo, to be involed in the first term in a time step: multipole for amoeba and repulsion for hippo
|
2022-09-30 16:31:13 -05:00 |
|
|
|
785131932c
|
Added fphi_mpole in amoeba/gpu, fixed a bug in the kernel when indexing grid
|
2022-09-20 13:58:17 -05:00 |
|
|
|
62ecf98cda
|
Enabled fphi_uind in hippo/gpu, really need to refactor hippo and amoeba in the GPU lib to remove kernel duplicates
|
2022-09-16 14:47:16 -05:00 |
|
|
|
cd3a00c2c4
|
Added timing breakdown for fphi_uind
|
2022-09-14 15:28:44 -05:00 |
|
|
|
9e7bbad4d4
|
Working on fphi_uind in the GPU lib
|
2022-08-27 13:19:52 -05:00 |
|
|
|
66ee2bf989
|
Cleaned up
|
2022-07-14 11:01:30 -05:00 |
|
|
|
0c44bd1086
|
Rearranged the order of real-space and kspace part of ufield0c(), delayed device-host transfer from umutual2b() to overlap with kspace part
|
2022-07-08 14:45:31 -05:00 |
|
|
|
79fbbd4f33
|
Cleaned up the API of amoeba and hippo to remove unncessary arguments
|
2021-10-04 14:40:58 -05:00 |
|
|
|
5a6426bf96
|
Only transfer data arrays that are needed in each kernel
|
2021-10-02 00:56:15 -05:00 |
|
|
|
f4d3d3a2b5
|
Gradually cleaned up and removed redundancy in amoeba and hippo
|
2021-10-02 00:09:53 -05:00 |
|
|
|
3328ac0df2
|
Attempted to remove some redundancy in data transfers in the amoeba kernels; keeping HIPPO independent of AMOEBA for now
|
2021-10-01 09:58:21 -05:00 |
|
|
|
ad9d45639e
|
Fixed bugs with damprep where ucl_powr in mixed precision failed with a negative single-reprecision base
|
2021-09-29 12:32:08 -05:00 |
|
|
|
4be44c386f
|
Added necessary arguments to the hippo repulsion kernel
|
2021-09-29 09:40:33 -05:00 |
|
|
|
17edd797a7
|
Adding API for the repulsion term to hippo/gpu
|
2021-09-28 23:42:04 -05:00 |
|
|
|
b95508125b
|
Adding the repulsion kernel for hippo
|
2021-09-28 23:24:34 -05:00 |
|
|
|
6286a119b3
|
Removed precompute() in hippo
|
2021-09-28 23:12:07 -05:00 |
|
|
|
b874feb127
|
Removed trailing spaces
|
2021-09-28 17:28:33 -05:00 |
|
|
|
e80eea56ba
|
Added udirect2b and umutual2b for hippo
|
2021-09-28 14:59:39 -05:00 |
|
|
|
d27836952a
|
Fixed a bug in neighbor.cpp to make special_flag consistent between amoeba and hippo (to be 2 instead of 0), that caused missing neighbors with hippo
|
2021-09-27 16:12:49 -05:00 |
|
|
|
2efd841a7e
|
Trying to find the difference in the neighbor list build in hippo vs amoeba
|
2021-09-27 11:35:35 -05:00 |
|
|
|
7437c98628
|
Fixed bugs in the polar real kernel in hippo, getting closer..
|
2021-09-26 09:11:09 -05:00 |
|
|
|
5193dcf8c5
|
Working on the polar real-space term of hippo
|
2021-09-26 00:56:29 -05:00 |
|
|
|
edbed9c9c9
|
Fixed bugs in HippoT::compute_dispersion_real and compute_multipole_real to ensure that answers only get copied back from device in the last kernel activated.
|
2021-09-26 00:13:40 -05:00 |
|
|
|
78ef0d631f
|
Working on the multipole real-space term of hippo
|
2021-09-25 12:25:34 -05:00 |
|
|
|
e77df80ce2
|
Working hippo multipole real-space term, added helper functions in a separate file
|
2021-09-24 16:44:43 -05:00 |
|
|
|
ad8164dfc0
|
Fixed bugs in the dispersion real-space term for hippo. NOTE: CPU version filter out neighbors with zero special_disp
|
2021-09-24 00:21:25 -05:00 |
|
|
|
d77d5b7f0a
|
Added classes for hippo/gpu, refactored BaseAmoeba and made room for the dispersion real-space term in hippo
|
2021-09-21 15:40:06 -05:00 |
|