No description
Find a file
2025-06-07 07:51:32 +02:00
fortran add copyright headers 2019-12-06 21:21:20 +01:00
libsharp2 name change to libsharp2 2019-12-06 13:53:27 +01:00
m4 Add auto code 2025-06-07 07:51:32 +02:00
python update library name 2019-12-06 14:27:56 +01:00
test fix typo 2022-09-29 14:59:32 +02:00
.gitignore Add further tools 2025-06-07 07:51:32 +02:00
aclocal.m4 Add auto code 2025-06-07 07:51:32 +02:00
ar-lib Add further tools 2025-06-07 07:51:32 +02:00
compile Add further tools 2025-06-07 07:51:32 +02:00
COMPILE name change to libsharp2 2019-12-06 13:53:27 +01:00
config.guess Update config.sub/guess 2025-06-07 07:51:32 +02:00
config.sub Update config.sub/guess 2025-06-07 07:51:32 +02:00
configure Add auto code 2025-06-07 07:51:32 +02:00
configure.ac update configure.ac 2022-05-22 09:38:27 +02:00
COPYING initial import 2012-06-27 16:44:31 +02:00
depcomp Add auto code 2025-06-07 07:51:32 +02:00
INSTALL Add auto code 2025-06-07 07:51:32 +02:00
install-sh Hack manually to use /bin/sh for all OS 2025-06-07 07:51:32 +02:00
ltmain.sh Add auto code 2025-06-07 07:51:32 +02:00
Makefile.am tweak libraries 2019-12-06 14:08:11 +01:00
Makefile.in Add auto code 2025-06-07 07:51:32 +02:00
missing Add auto code 2025-06-07 07:51:32 +02:00
README.md Add deprecation notice 2021-05-20 12:38:51 +00:00
runtest.sh name change to libsharp2 2019-12-06 13:53:27 +01:00
test-driver Add auto code 2025-06-07 07:51:32 +02:00

NOTICE

Active development of this package has stopped. The package will receive bug fixes if necessary, but otherwise the code has been integrated into the ducc0 package (https://gitlab.mpcdf.mpg.de/mtr/ducc), and further development is taking place there.

Please prefer ducc0 over libsharp2 if you are starting a new project!

Libsharp2

Library for efficient spherical harmonic transforms at arbitrary spins, supporting CPU vectorization, OpenMP and MPI.

Paper

https://arxiv.org/abs/1303.4945

News

January 2019

This update features significant speedups thanks to important algorithmic discoveries by Keiichi Ishioka (https://www.jstage.jst.go.jp/article/jmsj/96/2/96_2018-019/_article and personal communication).

These improvements reduce the fraction of CPU time spent on evaluating the recurrences for Y_lm coefficients, which means that computing multiple simultaneous SHTs no longer has a big performance advantage compared to SHTs done one after the other. As a consequence, libsharp's support for simultaneous SHTs was dropped, making its interface much simpler.

With the proper compilers and flags (see the file COMPILE for details) libsharp2 is now built with support for SSE2, AVX, AVX2, FMA3, FMA4 and AVX512f and the appropriate implementation is selected dynamically at runtime. This should provide a very significant performance boost for everyone using pre-compiled portable binaries.

Compilation

The library uses the standard autotools mechanism for configuration, compilation and installation. See the file COMPILE for configuration hints.