Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Naming convention (binsize -> bin_size, etc.) #316

Merged
merged 38 commits into from
Aug 7, 2020

Conversation

dizcza
Copy link
Member

@dizcza dizcza commented Apr 29, 2020

Update Sep 08, 2020

The voting session is closed: any additional votes will be ignored. If you find the current naming convention inconvenient, open an issue or discuss it with us on Neuralensemble mailing list or gitter chat.


IMPORTANT. The old arguments are deprecated but not removed. Even though it looks like they are removed from the code, the old signature will still work - it just throws a warning, which can become a hard error in 2 or 3 releases since then, if we want.

Below is a compiled list, made by elephant doc team with ❤️

Deprecated arguments mapping

Elephant-wise

  • binsize --> bin_size
  • num_bins --> n_bins in all modules except GPFA transform_info dict that has the key 'num_bins'; commit 0408896
  • binned_st[s] --> binned_spiketrain because binned_spiketrains, as spiketrains, would imply a list of BinnedSpikeTrain objects
  • fs -> sampling_frequency. fs is used in 4 elephant functions: butter, wavelet_transform, welch_psd, and welch_cohere.

  • _freq -> _frequency? freq is used 4 times: butter, welch_psd, welch_coherence, and GPFA

  • a pair of similar signals, spiketrains, and other variables will have suffixes _i and _j. Below is a list of affected functions.
elephant/asset.py:    def __init__(self, spiketrains, spiketrains_y=None, bin_size=3 * pq.ms,
elephant/asset.py:                 t_start_x=None, t_start_y=None, t_stop_x=None, t_stop_y=None
elephant/spike_train_correlation.py:    _CrossCorrHist.__init__(self, binned_spiketrain1, binned_spiketrain2, window):
elephant/spike_train_correlation.py:    def get_valid_lags(binned_spiketrain1, binned_spiketrain2):
elephant/spike_train_correlation.py:    def spike_time_tiling_coefficient(spiketrain_1, spiketrain_2, dt=0.005 * pq.s):
elephant/spike_train_correlation.py:    def run_P(spiketrain_1, spiketrain_2):
elephant/spike_train_correlation.py:    def cross_correlation_histogram(binned_spiketrain1, binned_spiketrain2,
elephant/spectral.py:def welch_cohere(x, y, n_segments=8, len_segment=None, freq_resolution=None,



Function-specific

  • cubic
    • data -> histogram
    • ximax -> max_iterations
  • victor_purpura_dist
    • trains --> spiketrains
    • q -> cost_factor; is there a particular reason, a paper, where q is used?
  • cv2 and lv v --> time_intervals
  • cell_assembly_detection
    • data --> binned_spiketrain
    • maxlag --> max_lag
    • min_occ -> min_occurences
    • same_config_cut -> same_configuration_pruning
  • cross_correlation_function
    • ch_pairs --> channel_pairs
    • nlags --> n_lags
    • env -> hilbert_envelope
  • lst_input -> input_spiketrains in BinnedSpikeTrain; closes renamed 'lst_input' -> 'spiketrains' in BinnedSpikeTrain class' #311
  • hilbert N --> padding (N is passed in scipy.signal.hilbert, which has this argument)
  • wavelet_transform
    • nco -> n_cycles
  • welch_psd, welch_cohere
    • num_seg -> n_segments
    • len_seg -> len_segment
    • freq_res -> freq_resolution
  • welch_cohere() -> welch_coherence(). As a remark, scipy, to my surprise, uses scipy.signal.coherence and not scipy.signal.cohere.
  • spike_extraction extr_interval -> interval - the function name already has the key "extraction"
  • peak_detection format -> as_array
  • single_interaction_process
    • rate_c -> coincidence_rate
    • return_coinc -> return_coincidences
    • n -> n_spiketrains
  • JointISI refr_period -> refractory_period
  • surrogates surr_method -> method
  • sskernel --> optimal_kernel_bandwidth
    • w -> bandwidth
    • tin -> times
  • spike_train_surrogates module
    • n --> n_surrogates
  • estimate_csd coords -> coordinates
  • kernels t -> time
  • extract_neo_attrs -> extract_neo_attributes
    • obj -> neo_object
  • generate_lfp
    • ele_xx -> x_positions or positions_x
    • xlims -> x_limits or limits_x? Currently, it's x_limits, but then why do we use max_something and not something_max?
    • res -> resolution;
  • multiple_filter_test
    • dt -> time_step or step
  • empirical_parameters
    • dt -> time_step or step
  • victor_purpura_dist -> victor_purpura_distance
    • trains -> spiketrains
    • q -> cost_factor
  • van_rossum_dist -> van_rossum_distance
    • trains -> spiketrains
    • tau -> time_constant
  • spike_train_timescale
    tau_max -> max_tau
  • compound_poisson_process A -> amplitude_distribution
  • cross_correlation_histogram cross_corr_coef -> cross_correlation_coefficient


  • corrcoef() -> correlation_coefficient()


Not refactored

Because they require special care (ASSET already follows the naming convention):

  • SPADE
  • Unitary Event Analysis
  • GPFA

Also, I didn't pay attention to _protected functions not because they can avoid the naming convention but because I didn't want to spend time refactoring them. Keep in mind that any decent renaming may lead to breaking the logic and thus should be justified.

Won't change

  • In homogeneous_gamma_process() function a is the shape and b is the rate of a distribution will not be renamed to avoid a misinterpretation of the rate of the outcome spiketrain, which is equal to b/a and not b.
  • In surrogates() function dt won't be changed because it means different things depending on the surrogate method choice.
  • In multiple_filter_test() and empirical_parameters() functions t_final won't be changed to t_stop because it serves a different purpose.
  • nfft became a standard over the years, even though it does not comply with our convention, which renders this variable as n_fft.

@pep8speaks
Copy link

pep8speaks commented Apr 29, 2020

Hello @dizcza! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

Line 14:5: E741 ambiguous variable name 'I'

Line 1073:80: E501 line too long (81 > 79 characters)

Comment last updated at 2020-08-07 11:56:20 UTC

@coveralls
Copy link
Collaborator

coveralls commented Apr 29, 2020

Coverage Status

Coverage decreased (-0.2%) to 88.983% when pulling 4d19722 on INM-6:naming-convention/bin_size into 9007517 on NeuralEnsemble:master.

@dizcza dizcza force-pushed the naming-convention/bin_size branch from 022c593 to 9bd7c29 Compare June 23, 2020 15:32
@dizcza dizcza changed the title Renamed binsize to bin_size to follow the doc guide line Naming convention (binsize -> bin_size, etc.) Jun 29, 2020
elephant/current_source_density.py Outdated Show resolved Hide resolved
elephant/spike_train_generation.py Outdated Show resolved Hide resolved
@kohlerca
Copy link
Collaborator

Regarding cross_coef/cross_corr_coef in spike_train_correlation:

The parameter in cross_correlation_histogram function is passed to the protected class _CrossCorrHist. Therefore, according to the conventions of just using corr as an abbreviation for correlation, cross_corr_coefficient will make the information clearer. The fact that it is passed internally, I don't see as an annoyance of a larger parameter name.

Moreover, in _CrossCorrHist, the class method should be cross_correlation_coefficient to keep the standard as in spike_time_tiling_coefficient, and to align to what we discussed regarding keeping the Elephant function names without abbreviations. The latter also applies to function corrcoef. Although NumPy has this function, in Elephant we are talking about an implementation that works with elephant.BinnedSpikeTrain inputs. I don't see any benefit in shadowing NumPy.

Finally, in the class _CrossCorrHist, the current cross_corr_coef function takes an argument named only cross_corr. It should also define which is the input, as cross_corr* is a prefix for histogram or matrices. Therefore, something in the direction of cross_corr_array or cross_corr_matrix will make the code clear.

@kohlerca
Copy link
Collaborator

Regarding _x and _y or _1 and _2 in welch_cohere and ASSET:

I don't think they can be analyzed together. The idea of _1 and _2 is for the functions where the parameter order does not matter, which is the case of welch_cohere.

In ASSET, the spiketrains_y will actually reflect in the interpretation of the matrices axes. Therefore, _x and _y is meaningful with respect to the method output. ASSET is tough because you don't need both parameters, but I think it is better to name the first (non-optional) as _x and the optional as _y.

@mdenker
Copy link
Member

mdenker commented Jul 14, 2020

@kohlerca Regarding _x and _y or _1 and _2 in welch_cohere and ASSET.

Also in welch_cohere, the order of parameters makes a big differene when interpreting the phase_lag return value, since the order defines the reference and the referenced, so to say. In this way, I think the situation is very similar to ASSET. Now, I think the main reason we dislike 1 and 2 is because it looks terrible in the variable name, and I think we dislike x and y because it is commonly associated to a position. How about a compromise and using i and j? This is commonly used to identify this type of relationship...

@kohlerca
Copy link
Collaborator

@mdenker How about a compromise and using i and j? This is commonly used to identify this type of relationship...

I guess this makes total sense and it is better indeed than either of the previous options.

@dizcza
Copy link
Member Author

dizcza commented Jul 15, 2020

@mdenker How about a compromise and using i and j? This is commonly used to identify this type of relationship...

I guess this makes total sense and it is better indeed than either of the previous options.

Added _i and _j option in the poll.

@dizcza dizcza merged commit 9e67193 into NeuralEnsemble:master Aug 7, 2020
@dizcza dizcza deleted the naming-convention/bin_size branch August 7, 2020 12:12
BranchBroxy pushed a commit to BranchBroxy/elephant that referenced this pull request Sep 15, 2020
* faster mean_firing_rate for a typical use case (NeuralEnsemble#331)

* explicit doc of how the firing rate is computed

* Timescale function option to return nan if there are few spikes (<=2) (NeuralEnsemble#328)

* include option to return np.nan if spiketrains are too short and change raised error to more meaningful ones

* Build documentation in Travis (NeuralEnsemble#335)

* fixed _check_consistency in BinnedSpikeTrain and parallel.ipynb notebook

* added docs build in travis; fixed plt.eventplot(spiketrains)

* don't use conda-forge channel for python 2.7

* install extra requirements in docs build

* conda install pandoc

* travis hangs

* travis doc mpi error

* travis check if error is occurred

* Revert "travis check if error is occurred"

This reverts commit 3e59894.

* reverted travis

* Removed deprecation warnings from 0.7.0 (NeuralEnsemble#306)

* removed deprecation warning from unitary event analysis

* removed homogeneous_poisson_process_with_refr_period

* don't import pandas_bridge

* don't use nan

* don't pad with zeros

* bin_shrink_factor parameter

* precompute edges for _binning_half_overlap

* deal with spiketrains of length 1

* return trace optionally

* ASSET optimized probability_matrix_analytical and looping in _jsf_uniform_orderstat_3d (NeuralEnsemble#333)

* removed annoying short-lasting tranges
* vectorized probability_matrix_analytical
* ASSET iterate indices optimized
* replaced np.diff(prepend, append) by manually prepending and appending

Co-authored-by: Cristiano Köhler <[email protected]>

* Naming convention (binsize -> bin_size, etc.) (NeuralEnsemble#316)

* Release v0.8.0 (NeuralEnsemble#340)

* all quantities

* python2 issues

* python2 issues again

* Update acknowledgments.rst

Added funding acknowledgements from HBP SGA3.

* added __all__ in elephant modules (NeuralEnsemble#342)

* Added a warning in fanofactor function when the input spiketrains vary in their durations (NeuralEnsemble#341)

* fixed wrong default min_bin units

* naming

* download & unzip API

* Feature/inhomogeneous gamma (NeuralEnsemble#339)

* Added information on citing Elephant to documentation, fixed bib entries (NeuralEnsemble#345)

* Added CITATION.txt file to manifest, so it's included in packages.

* Fixed doi entry to not include the doi resolver.

* Three surrogate methods: Trial-shifting, Bin Shuffling, ISI dithering (NeuralEnsemble#343)

Co-authored-by: stellalessandra <[email protected]>
Co-authored-by: p-bouss <[email protected]>
Co-authored-by: Cristiano Köhler <[email protected]>

* SPADE: New way to count patterns for multiple testing (NeuralEnsemble#347)

Co-authored-by: stellalessandra <[email protected]>
Co-authored-by: p-bouss <[email protected]>

* spike synchrony doc; take the first 5 networks to run the test

* renamed test module

* group spike train correlation, dissimilarity, and synchrony

* tutorials: changed wget to curl for platform compatibility (NeuralEnsemble#350)

* Time-domain pairwise Granger causality  (NeuralEnsemble#332)

Co-authored-by: ackurth <[email protected]>
Co-authored-by: ackurth <[email protected]>
Co-authored-by: dizcza <[email protected]>
Co-authored-by: Michael Denker <[email protected]>

Co-authored-by: Aitor MG <[email protected]>
Co-authored-by: Cristiano Köhler <[email protected]>
Co-authored-by: Michael Denker <[email protected]>
Co-authored-by: pbouss <[email protected]>
Co-authored-by: stellalessandra <[email protected]>
Co-authored-by: p-bouss <[email protected]>
Co-authored-by: Cristiano Köhler <[email protected]>
Co-authored-by: Regimantas Jurkus <[email protected]>
Co-authored-by: ackurth <[email protected]>
Co-authored-by: ackurth <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

6 participants