Update tcm methods to new VK version #25

camilogarciabotero · 2024-03-18T23:50:42Z

This PR updates the transition_count_matrix to be compatible with new version of VectorizedKmers.jl (v0.9).

…rkovChain

camilogarciabotero

For the sake of recording some unusual benches:

using BioSequences, BioMarkovChains

seq = randdnaseq(10^6)

@btime transition_count_matrix(seq); #main
1.101 ms (2 allocations: 384 bytes)

@btime transition_count_matrix(seq); #vkpatch
1.272 ms (31 allocations: 1.58 KiB)

I honestly don't know why is that happening since the underlying function count_kmers is improved in the new VectorizedKmers version. I do wonder if @anton083 do you know what might be going on here?

AntonOresten · 2024-03-24T00:47:28Z

Hi Camilo!

Good catch. I found the issue to be the extra overhead of calling:

kmer_array[kmer] += 1

instead of:

kmer_array.values[kmer + 1] += 1

in my count_kmers! methods.

The former method is defined like so:

@inline Base.setindex!(ka::KmerArray, v, kmer::Integer) = (ka.values[kmer + 1] = v)

with @inline and everything, so I didn't think there'd be much of a difference.

I'll be switching to the latter in the count_kmers! methods in a new patch.

Thank you for spotting this!😊
Anton

AntonOresten · 2024-03-24T02:31:34Z

The changes have now been pushed to patch v0.9.1.

using VectorizedKmers, BioSequences, BenchmarkTools

@btime count_kmers($(randdnaseq(10^6)), 2) # v0.8.1
  990.500 μs (28 allocations: 1.38 KiB)

@btime count_kmers($(randdnaseq(10^6)), 2) # v0.9.0
  1.054 ms (29 allocations: 1.38 KiB)

@btime count_kmers($(randdnaseq(10^6)), 2); # v0.9.1
  963.600 μs (29 allocations: 1.38 KiB)

v0.9.1 is only performing better than v0.8.1 because I'm now indexing a pre-defined kmer_array_values = kmer_array.values instead of indexing kmer_array.values every time. The tiniest changes can have significant impact at this level.😆

camilogarciabotero · 2024-03-24T04:13:11Z

Thank you Anton,

The runtime performance looks nice now. I was also confused/concerned by the number and memory allocations I noticed in the benchmark. I don't know if you can reproduce it with BioMarkovChains.

AntonOresten · 2024-03-24T14:41:18Z

I'm afraid the memory allocations come from neither the KmerArray constructor, nor count_kmers! function itself, but instead the dispatching of count_kmers! within the count_kmers function. I'm not sure, but this might be due to some type-instability.

@btime begin
    kmer_array = KmerArray(4, 2)
    count_kmers!(kmer_array, seq)
end;
  958.300 μs (1 allocation: 192 bytes)

function _count_kmers(seq, K::Integer)
    kmer_array = KmerArray(4, K)
    count_kmers!(kmer_array, seq)
end;

@btime _count_kmers(seq, 2);
  961.500 μs (29 allocations: 1.38 KiB)

As soon as we put it in a function, it does some extra allocations for some reason.

And we can narrow this down to the count_kmers! call because without it we don't see the allocations:

function _count_kmers_no_counting(seq, K::Integer)
    kmer_array = KmerArray(4, K)
    #count_kmers!(kmer_array, seq)
end;

@btime _count_kmers_no_counting(4, 2);
  35.944 ns (1 allocation: 192 bytes)

But as you can see in the first snippet, the count_kmers! call itself doesn't seem to make any allocations.

Maybe relevant: https://discourse.julialang.org/t/unexpected-allocations-with-multiple-dispatch/76988

EDIT: Now in hindsight it's clear that the types in the _count_kmers function weren't stable cause K was a function argument, whereas N and K were fixed in the first block.

AntonOresten · 2024-03-24T16:24:12Z

I have discovered the problem. When alphabet size N and K-mer length K are passed as regular arguments to count_kmers their values are unknown at compile time. To fix this, the base method for count_kmers should take integers wrapped in Vals, such that it can compile for each N and K pair individually.

count_kmers1(seq, N::Int, K::Int) = count_kmers!(KmerArray(N, K), seq);

@btime count_kmers1(seq, 4, 2);
  964.000 μs (29 allocations: 1.38 KiB)

count_kmers2(seq, ::Val{N}, ::Val{K}) where {N, K} = count_kmers!(KmerArray(N, K), seq);

@btime count_kmers2(seq, Val(4), Val(2));
  958.500 μs (2 allocations: 208 bytes)

For the API to remain the same, all we need is another method that wraps integers N and K in Vals.

count_kmers2(seq, N::Int, K::Int) = count_kmers2(seq, Val(N), Val(K))

I will still make sure that there is a method that uses the default alphabet size, such that users don't need to specify the value of N.

I think I've seen the bio people use this in Kmers.jl, but I never really understood the benefit. Now I do!

camilogarciabotero · 2024-03-25T23:52:36Z

Wow! That was a tight demonstration of problem-solving! Thanks for sharing the rationale to solve it Anton. I will now update to latest VK version!

camilogarciabotero added 2 commits March 18, 2024 18:40

Update methods for VectorizedKmers update

986bb85

Update tpm to be more similar to previous implementations

95cca59

camilogarciabotero self-assigned this Mar 19, 2024

camilogarciabotero added 2 commits March 18, 2024 19:37

Update show function in extended.jl to include the order of the BioMa…

7ebb2ef

…rkovChain

Remove commented code and unnecessary print statements

79feb70

camilogarciabotero commented Mar 23, 2024

View reviewed changes

This comment was marked as duplicate.

Sign in to view

AntonOresten referenced this pull request in AntonOresten/VectorizedKmers.jl Mar 24, 2024

Optimize count_kmers! methods

a00277a

AntonOresten referenced this pull request in AntonOresten/VectorizedKmers.jl Mar 24, 2024

Bump version

a9c64ed

camilogarciabotero merged commit c2234c3 into main Mar 26, 2024
4 checks passed

camilogarciabotero deleted the vkpatch branch March 26, 2024 00:05

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update tcm methods to new VK version #25

Update tcm methods to new VK version #25

camilogarciabotero commented Mar 18, 2024 •

edited

Loading

camilogarciabotero left a comment •

edited

Loading

This comment was marked as duplicate.

AntonOresten commented Mar 24, 2024 •

edited

Loading

AntonOresten commented Mar 24, 2024 •

edited

Loading

camilogarciabotero commented Mar 24, 2024

AntonOresten commented Mar 24, 2024 •

edited

Loading

AntonOresten commented Mar 24, 2024 •

edited

Loading

camilogarciabotero commented Mar 25, 2024

Update tcm methods to new VK version #25

Update tcm methods to new VK version #25

Conversation

camilogarciabotero commented Mar 18, 2024 • edited Loading

camilogarciabotero left a comment • edited Loading

Choose a reason for hiding this comment

This comment was marked as duplicate.

AntonOresten commented Mar 24, 2024 • edited Loading

AntonOresten commented Mar 24, 2024 • edited Loading

camilogarciabotero commented Mar 24, 2024

AntonOresten commented Mar 24, 2024 • edited Loading

AntonOresten commented Mar 24, 2024 • edited Loading

camilogarciabotero commented Mar 25, 2024

camilogarciabotero commented Mar 18, 2024 •

edited

Loading

camilogarciabotero left a comment •

edited

Loading

AntonOresten commented Mar 24, 2024 •

edited

Loading

AntonOresten commented Mar 24, 2024 •

edited

Loading

AntonOresten commented Mar 24, 2024 •

edited

Loading

AntonOresten commented Mar 24, 2024 •

edited

Loading