No description
Find a file
2019-06-13 13:42:26 +02:00
fortran prepare simple Fortran interface 2019-05-07 13:56:08 +02:00
libsharp reorganization 2019-02-28 10:29:56 +01:00
m4 step 1 2018-10-15 17:26:23 +02:00
pocketfft prepare simple Fortran interface 2019-05-07 13:56:08 +02:00
python add experimental Python interface 2019-06-13 13:42:26 +02:00
test reorganization 2019-02-28 10:29:56 +01:00
.gitignore tweaks 2019-01-11 17:25:21 +01:00
COMPILE cosmetics 2019-05-07 14:10:44 +02:00
configure.ac prepare simple Fortran interface 2019-05-07 13:56:08 +02:00
COPYING initial import 2012-06-27 16:44:31 +02:00
Makefile.am add MPI-specific files 2019-05-31 22:14:03 +02:00
README.md typo 2019-01-18 09:35:07 +01:00
runtest.sh add test 2019-01-12 12:56:27 +01:00

Libsharp

Library for efficient spherical harmonic transforms at arbitrary spins, supporting CPU vectorization, OpenMP and MPI.

Paper

https://arxiv.org/abs/1303.4945

News

January 2019

This update features significant speedups thanks to important algorithmic discoveries by Keiichi Ishioka (https://www.jstage.jst.go.jp/article/jmsj/96/2/96_2018-019/_article and personal communication).

These improvements reduce the fraction of CPU time spent on evaluating the recurrences for Y_lm coefficients, which means that computing multiple simultaneous SHTs no longer has a big performance advantage compared to SHTs done one after the other. As a consequence, libsharp support for simultaneous SHTs was dropped, making its interface much simpler.

With the proper compilers and flags (see the file COMPILE for details) libsharp is now built with support for SSE2, AVX, AVX2, FMA3, FMA4 and AVX512f and the appropriate implementation is selected dynamically at runtime. This should provide a very significant performance boost for everyone using pre-compiled portable binaries.

Compilation

The library uses the standard autotools mechanism for configuration, compilation and installation. See the file COMPILE for configuration hints.