Optimized SPADE analysis #419

mdenker · 2021-05-14T09:31:43Z

This PR includes a new version of the FP-growth algorithm for the SPADE analysis based on the work of Florian Porrmann (https://github.com/fporrmann/FPG). The new library features a number of SPADE specific optimizations that improve run-time significantly.

In order to include the C++ module, the install procedure was altered to compile the corresponding fim.so from scratch, rather than downloading a precompiled module post-install. Also, recipes for building pip wheels for Linux and Windows were included as github actions, building in cibuildwheels. Currently, these are configured to run on every push, and after a testing period, this may be altered to build only after a PR or even only after a version-tagged PR.

Also, a change from nosetest to pytest was required in order to have available the --import-lib option that makes sure the compiled version of the library is used for unit testing. This includes most changes of #413 .

@stellalessandra There is one unit test in line 275 of test_spade.py that you had commented out, and which fails. Does this need fixing?

…o fpgrowth

…lerated_spade

* Added cibuildwheel action * Added Python requirements to wheel build * Build only on 64bit machines, otherwise overflow * Removed Windows for testing, as vc is not available * Removed MacOS for testing, as -fopenmp is not available * Removed pp- (pypy) builds since they lack C. * Fixed removing pp- (pypy) builds since they lack C. * Put Macos back in. * Windows Hack * Remove vcpython alltogether, ignore 2.7 Python * Removed extra compile option, which breaks on Windows * Removed more extra compile options, which breaks on Windows * Try C++ instead of Gnu++. * Try C++ instead of Gnu++ Windows style argument. * Remove linux build while testing windows. * Remove libraries. * Differentiate Windows and Linux. * Added missing import. * Last mile: MacOS * Remove openMP lib * Remove openMP lib * Add openMP lib * More brew installs * Mac is called mac on github * Make sure C is reinstalled. * Multilib * Next try, new options * Ignore warning about void type * Update newsest fim package * Revert "Ignore warning about void type" This reverts commit 3ff6b62 * Revert to prior fim, new compiler argument. * Revert "Update newsest fim package" This reverts commit f321f77 * Definitely, gnu++17, but new try. * Try C++ * Warning message * llvm maybe? * Added apple in source * Small fixes for MacOS, but not comprehensive * Limit to Windows and Linux for now * Remove MacOS entry * Fix fix from mindlessness * Testrun * Trying to include fim.so, despite its renaming by wheels * Added newest version of original module * Reverted previous breaking change commited by accident. * Reverted package name from testing. * Test focal as CI build * Test bionic as CI build * Understand installation issue on CI -- is importing elephant importing the installed version? * Spelling error only * Try to make sure travis loads the installed elephant, not the cwd. * One step further -- which version will nosetests use? * Switch to pytest as of PR NeuralEnsemble#413

pep8speaks · 2021-05-14T09:31:47Z

Hello @mdenker! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file elephant/spade.py:

Line 28:80: E501 line too long (85 > 79 characters)
Line 29:80: E501 line too long (84 > 79 characters)
Line 31:80: E501 line too long (83 > 79 characters)
Line 32:80: E501 line too long (99 > 79 characters)
Line 33:80: E501 line too long (97 > 79 characters)
Line 34:80: E501 line too long (81 > 79 characters)

In the file elephant/test/test_spade.py:

Line 311:80: E501 line too long (82 > 79 characters)
Line 312:35: E231 missing whitespace after ','

In the file setup.py:

Line 32:13: E251 unexpected spaces around keyword / parameter equals
Line 32:15: E251 unexpected spaces around keyword / parameter equals
Line 33:16: E251 unexpected spaces around keyword / parameter equals
Line 33:18: E251 unexpected spaces around keyword / parameter equals
Line 34:21: E251 unexpected spaces around keyword / parameter equals
Line 34:23: E251 unexpected spaces around keyword / parameter equals
Line 35:17: E251 unexpected spaces around keyword / parameter equals
Line 35:19: E251 unexpected spaces around keyword / parameter equals
Line 36:18: E251 unexpected spaces around keyword / parameter equals
Line 36:20: E251 unexpected spaces around keyword / parameter equals
Line 37:27: E251 unexpected spaces around keyword / parameter equals
Line 37:29: E251 unexpected spaces around keyword / parameter equals
Line 66:80: E501 line too long (89 > 79 characters)

Comment last updated at 2021-08-13 06:01:30 UTC

stellalessandra · 2021-05-14T09:57:56Z

@mdenker I commented out the test as a filter for the maximum number of occurrences is not implemented in Florian's fpgrowth. It is a filtering not to mine patterns with more occurrences of the one fixed. I think we can just remove it from SPADE, as it is an unintuitive parameter given the purposes of the method. I don't see why one should decide to mine patterns with fewer occurrences that the ones that can be found...

mdenker · 2021-05-14T10:11:29Z

@stellalessandra Ah, thanks for the clarification. Then I suggest to leave the unit test commented for now, and discuss removing this parameter in a new issue/PR, in order not to blow up this PR.

coveralls · 2021-05-14T10:24:59Z

Coverage decreased (-0.5%) to 88.412% when pulling 11d963d on INM-6:enh/accelerated_spade into c461884 on NeuralEnsemble:master.

…accelerated_spade # Conflicts: # MANIFEST.in

Kleinjohann

I found a few typos and I'm not sure about a second test that's also commented out. I was able to verify locally that the build works on Ubuntu using act, but Windows is not supported there, how did you test that @mdenker ?

elephant/spade.py

Kleinjohann · 2021-05-26T15:15:07Z

elephant/test/test_spade.py

+    # def test_different_surrogate_method(self):
+    #     np.random.seed(0)
+    #     random.seed(0)
+    #     spiketrains = [stg.homogeneous_poisson_process(rate=20*pq.Hz)
+    #                    for _ in range(2)]
+    #     surr_methods = ('dither_spikes', 'joint_isi_dithering',
+    #                     'bin_shuffling',
+    #                     'dither_spikes_with_refractory_period')
+    #     pv_specs = {'dither_spikes': [[2, 2, 0.8], [2, 3, 0.2]],
+    #                 'joint_isi_dithering': [[2, 2, 0.8]],
+    #                 'bin_shuffling': [[2, 2, 1.0], [2, 3, 0.2]],
+    #                 'dither_spikes_with_refractory_period':
+    #                     [[2, 2, 0.8]]}
+    #     for surr_method in surr_methods:
+    #         pv_spec = spade.pvalue_spectrum(
+    #             spiketrains, bin_size=self.bin_size,
+    #             winlen=self.winlen, dither=15*pq.ms,
+    #             n_surr=5, surr_method=surr_method)
+    #         self.assertEqual(pv_spec, pv_specs[surr_method])


What's up with this test?

Not sure -- it does pass for me, also appear to be functional. @stellalessandra Do you know why this one is commented out?

I think we should remove it. It is a test that cross-checks across some surrogate methods if they lead to the same p-value spectrum. But, it is not general enough, and we have tests in spike_train_surrogates.py that check the correctness of the surrogate techniques...

Ok, I see the point, I'd agree this would be somewhat redundant, and may break in case of changes to the implementations of the surrogate methods, even if those are correct (i.e.: maybe after an update of the algorithm, a surrogate method produces a different realization with a given random seed -- in that case this test would start to fail, but there would be no bug.

Co-authored-by: Alexander Kleinjohann <[email protected]>

… of surrogate methods

pbouss and others added 21 commits September 15, 2020 12:08

bug fixed dealing with units

14ee56f

fixed same unit error in trial shuffling

4160ca0

added Bielefeld fim.so and adapted filtering and added window param t…

1e9132b

…o fpgrowth

removed max_occ test

e534c70

Merge branch 'master' into enh/accelerated_spade

c4c02dd

further unit stuff

3f1d2ac

Merge remote-tracking branch 'origin/fix/bin_shuffling' into enh/acce…

4a856d3

…lerated_spade

fixed conflict when merging with master

9b03add

Merge branch 'master' into enh/accelerated_spade

692862e

Merge branch 'master' into enh/accelerated_spade

3e7a8f7

fixed merge

b385033

debugging for new fim version

929e202

enabled multithreading in fpgrowth

4bb029f

less verbose in spade

bc8bf9f

set equal number of bins in bin shuffling wrt spade

3cd75a4

added tolerance to binning in spade everywhere

0c762e7

Added accelerated FIM algorithm sources by Florian Porrmann.

0c3839f

Added authors of new FIM module and reference in new docs.

8561f7b

Added authors of new FIM module and reference in new docs.

4a71dfb

Small text clarifications.

4c00f74

mdenker added the enhancement Editing an existing module, improving something label May 14, 2021

Merge branch 'master' into enh/accelerated_spade

555628d

mdenker added 2 commits May 17, 2021 09:01

Test if entry for fim.so/pyd in MANIFEST is now redundant.

5b036c3

Merge remote-tracking branch 'origin/enh/accelerated_spade' into enh/…

a80093a

…accelerated_spade # Conflicts: # MANIFEST.in

mdenker self-assigned this May 17, 2021

mdenker removed their assignment May 17, 2021

mdenker added this to the v0.11.0 milestone May 21, 2021

Kleinjohann reviewed May 26, 2021

View reviewed changes

mdenker self-assigned this Jun 6, 2021

mdenker and others added 12 commits August 10, 2021 16:40

Update elephant/spade.py

4e2e7fc

Co-authored-by: Alexander Kleinjohann <[email protected]>

Update elephant/spade.py

cbe1cda

Co-authored-by: Alexander Kleinjohann <[email protected]>

Added SPADE tutorial

35a59b1

Prevent wheel building on every push, and limit scipy version workaround

1d29550

Pushed tutorial, removed file added in error

e8a6988

New attempt to make mybinder install requirements.

81f7785

New attempt, dropping viziphant.

22c4aa8

Avoid recursive elephant installation by viziphant in postBuild

e5c9349

Removed unit test that is fragile as it depends on the implementation…

95dd20d

… of surrogate methods

Add viziphant to RTD environment

0232eae

Typo in tutorial

5a87794

Add viziphant to travis doc tests

11d963d

mdenker merged commit 8c388e4 into NeuralEnsemble:master Aug 13, 2021

Moritz-Alexander-Kern mentioned this pull request Mar 9, 2022

Enh/add zenodo json #459

Merged

1 task

Moritz-Alexander-Kern deleted the enh/accelerated_spade branch October 28, 2022 07:35

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimized SPADE analysis #419

Optimized SPADE analysis #419

mdenker commented May 14, 2021

pep8speaks commented May 14, 2021 •

edited

Loading

stellalessandra commented May 14, 2021

mdenker commented May 14, 2021

coveralls commented May 14, 2021 •

edited

Loading

Kleinjohann left a comment

Kleinjohann May 26, 2021

mdenker Aug 10, 2021

stellalessandra Aug 10, 2021

mdenker Aug 12, 2021

Optimized SPADE analysis #419

Optimized SPADE analysis #419

Conversation

mdenker commented May 14, 2021

pep8speaks commented May 14, 2021 • edited Loading

Comment last updated at 2021-08-13 06:01:30 UTC

stellalessandra commented May 14, 2021

mdenker commented May 14, 2021

coveralls commented May 14, 2021 • edited Loading

Kleinjohann left a comment

Choose a reason for hiding this comment

Kleinjohann May 26, 2021

Choose a reason for hiding this comment

mdenker Aug 10, 2021

Choose a reason for hiding this comment

stellalessandra Aug 10, 2021

Choose a reason for hiding this comment

mdenker Aug 12, 2021

Choose a reason for hiding this comment

pep8speaks commented May 14, 2021 •

edited

Loading

coveralls commented May 14, 2021 •

edited

Loading