Updated Train- kernel-parameters example #430

Crown421 · 2022-01-28T15:55:10Z

Summary
Picking up #317, and adding more to it. The commits might a bit of a mess, due to some unfortunate experiments with the github desktop app.

Proposed changes

What alternatives have you considered?

#234)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

#234)

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

…iaGaussianProcesses/KernelFunctions.jl into st/examples--train-kernel-parameters

commit f9bbd84 Author: st-- <[email protected]> Date: Fri Jan 28 09:11:50 2022 +0100 make nystrom work with AbstractVector (#427) * make nystrom work with AbstractVector * add test * Update test/approximations/nystrom.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * patch bump * Update test/approximations/nystrom.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: David Widmann <[email protected]> * Apply suggestions from code review * deprecate * Apply suggestions from code review Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: David Widmann <[email protected]> * Apply suggestions from code review Co-authored-by: Théo Galy-Fajou <[email protected]> * Update src/approximations/nystrom.jl Co-authored-by: Théo Galy-Fajou <[email protected]> * Update src/approximations/nystrom.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: David Widmann <[email protected]> Co-authored-by: Théo Galy-Fajou <[email protected]> commit d1c68a9 Author: st-- <[email protected]> Date: Thu Jan 13 22:33:43 2022 +0100 fix Distances compat (#423) * CompatHelper: bump compat for Distances to 0.10 for package test, (keep existing compat) * try out Theo's fix * fix test compat * use ForwardDiff for chain rule test of SqMahalanobis * test on 1.4 instead of 1.3 - see if the chainrules test passes there * revert version branch * revert to 1.3 * test_broken for older Julia versions Co-authored-by: CompatHelper Julia <[email protected]> commit 93d33c2 Author: st-- <[email protected]> Date: Wed Jan 12 14:11:14 2022 +0100 fix figure & cleanup (#422) * fix figure & cleanup * bump LIBSVM compat & Manifest * improve writing, replaces #321 commit 40cb59e Author: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Date: Wed Jan 12 09:39:01 2022 +0200 CompatHelper: bump compat for Kronecker to 0.5 for package docs, (keep existing compat) (#367) * CompatHelper: bump compat for Kronecker to 0.5 for package docs, (keep existing compat) * ] up Co-authored-by: CompatHelper Julia <[email protected]> Co-authored-by: st-- <[email protected]> commit 7204529 Author: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Date: Tue Jan 11 18:37:23 2022 +0200 CompatHelper: bump compat for Kronecker to 0.5 for package test, (keep existing compat) (#366) Co-authored-by: CompatHelper Julia <[email protected]> Co-authored-by: st-- <[email protected]> commit 924925d Author: st-- <[email protected]> Date: Tue Jan 11 16:26:02 2022 +0100 switch SVM example to half-moon dataset (#421) commit 992b665 Author: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Date: Fri Dec 24 12:18:56 2021 +0200 CompatHelper: bump compat for SpecialFunctions to 2 for package test, (keep existing compat) (#412) Co-authored-by: CompatHelper Julia <[email protected]> Co-authored-by: Théo Galy-Fajou <[email protected]> Co-authored-by: st-- <[email protected]> commit 04fa7f7 Author: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Date: Thu Dec 23 13:33:59 2021 +0200 CompatHelper: bump compat for SpecialFunctions to 2, (keep existing compat) (#411) Co-authored-by: CompatHelper Julia <[email protected]> Co-authored-by: Théo Galy-Fajou <[email protected]> Co-authored-by: st-- <[email protected]> commit c0fc3e1 Author: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Date: Thu Dec 23 01:10:40 2021 +0200 CompatHelper: add new compat entry for Compat at version 3 for package test, (keep existing compat) (#418) Co-authored-by: CompatHelper Julia <[email protected]> commit 05fe340 Author: st-- <[email protected]> Date: Tue Dec 21 00:49:37 2021 +0200 use only() instead of first() (#403) * use only() instead of first() for 1-"vectors" that were for the benefit of Flux * fix one test that should not have worked as it was * add missing scalar Sinus constructor commit 2d17212 Author: st-- <[email protected]> Date: Sat Dec 18 23:43:30 2021 +0200 Zygote AD failure workarounds & test cleanup (#414) Zygote AD failures: * revert #409 (test_utils workaround for broken Zygote - now working again) * disable broken Zygote AD test for ChainTransform Improved tests: * finer-grained testsets * add missing test cases to test_AD * replace test_FiniteDiff with test_AD(..., :FiniteDiff, ...) * remove code duplication commit 3c49949 Author: Théo Galy-Fajou <[email protected]> Date: Wed Nov 24 18:32:19 2021 +0100 Fix typo in valid_inputs error (#408) * Fix typo in valid_inputs error * Update src/utils.jl Co-authored-by: David Widmann <[email protected]> Co-authored-by: David Widmann <[email protected]> commit 9955044 Author: st-- <[email protected]> Date: Wed Nov 24 18:55:18 2021 +0200 Fix for Zygote 0.6.30 breaking our tests (#409) * restrict Zygote to <0.6.30 * revert Zygote test restriction and add finer-grained testset * Update test/utils.jl Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * revert testset * mark test_broken * Use `@test_throws` instead of `@test_broken` Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: David Widmann <[email protected]> commit 33d64d1 Author: Théo Galy-Fajou <[email protected]> Date: Thu Nov 4 14:23:57 2021 +0100 Add benchmarking CI (#399) * Add benchmark file * delete old benchmarks * Add github action * Add Project * Apply suggestions from code review Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> * Missing end of line Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> commit 360ce10 Author: David Widmann <[email protected]> Date: Tue Nov 2 11:09:58 2021 +0100 Update docstring of `GibbsKernel` (#395)

examples/train-kernel-parameters/script.jl

codecov · 2022-01-28T16:00:29Z

Codecov Report

Merging #430 (04a5d98) into master (aa9ac9f) will not change coverage.
The diff coverage is n/a.

@@           Coverage Diff           @@
##           master     #430   +/-   ##
=======================================
  Coverage   93.13%   93.13%           
=======================================
  Files          52       52           
  Lines        1252     1252           
=======================================
  Hits         1166     1166           
  Misses         86       86

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update aa9ac9f...04a5d98. Read the comment docs.

Crown421 · 2022-01-28T16:17:25Z

I didn't build the documentation locally, is there a way to preview from here?

devmotion · 2022-01-28T16:30:29Z

There's a preview for PRs that are not from forks, the link to the deployed docs will show up among the Github actions (see e.g. #429). However, currently there's no link since the documentation is not built successfully.

devmotion · 2022-01-28T16:33:25Z

The problem is that BenchmarkTools is not installed: https://github.com/JuliaGaussianProcesses/KernelFunctions.jl/runs/4983109277?check_suite_focus=true#step:5:31 Is it actually necessary to perform benchmarks?

Crown421 · 2022-01-28T16:34:59Z

The problem is that BenchmarkTools is not installed: https://github.com/JuliaGaussianProcesses/KernelFunctions.jl/runs/4983109277?check_suite_focus=true#step:5:31 Is it actually necessary to perform benchmarks?

Ah, that makes alot of sense. Not strictly necessary, but there are interesting things to see.

st-- · 2022-02-17T15:06:00Z

examples/train-kernel-parameters/script.jl

+
+function loss(θ)
+    ŷ = f(x_train, x_train, y_train, θ)
+    return sum(abs2, y_train - ŷ) + exp(θ[4]) * norm(ŷ)


Two questions:
a) why not

Suggested change

return sum(abs2, y_train - ŷ) + exp(θ[4]) * norm(ŷ)

return norm(y_train - ŷ) + exp(θ[4]) * norm(ŷ)

b) can you explain where the second term comes from? Just thinking about "parameter regularisation" might make one think it ought to be norm(\theta).

This part is from your original example, I am honestly not sure.

which in turn was part of a previously existing example 😅 @theogf @willtebbutt how would you address this (comment to add / equation to fix / ...)?

st-- · 2022-02-17T15:06:39Z

examples/train-kernel-parameters/script.jl

+
+# ### Training
+# Setting an initial value and initializing the optimizer: 
+θ = log.([1.1, 0.1, 0.01, 0.001]) # Initial vector


why not

Suggested change

θ = log.([1.1, 0.1, 0.01, 0.001]) # Initial vector

θ = log.(ones(4)) # Initial vector

as you had before?
(alternatively, use the values from here above as well?)

NB- what do you think of commenting explicitly on this being a mutable vector that will be changed in-place by the optimiser ?

I think this also comes from the original example, can experiment with setting it to ones.

it's probably a reasonable setting to end up in the right local optimum - maybe simpler to just change the very first ones(4) to also be [1.1, 0.1, 0.01, 0.001] instead?:)

st-- · 2022-02-17T15:08:37Z

examples/train-kernel-parameters/script.jl

+
+# Computational cost for one step
+
+@benchmark let θt = θ[:], optt = Optimise.ADAGrad(0.5)


why do you introduce a new optt? (I'm assuming it's due to the internal optimizer state that you don't want to affect for the actual optimisation loop, but this might not be clear to the casual reader!)

As it's within a let block, you could just call it opt and it wouldn't get outside the scope of the let block.
It might then also be cleaner if you move the opt = ... definition from line 80 to the section below (line 94 following) where it's actually used.

I tried a few things, but there seemed to be some leaking. This was my way of fixing that leakage.
I can take another look.

You definitely need to copy the \theta so you don't change the initial values. But

@benchmark let θ = log.(ones(4)) opt = Optimise.ADAGrad(0.5) ... end

should work

st-- · 2022-02-17T15:09:19Z

examples/train-kernel-parameters/script.jl

+gif(anim, "train-kernel-param.gif"; show_msg=false, fps=15);
+nothing; #hide
+
+# ![](train-kernel-param.gif)


what's the benefit of this vs. having the output from gif() inline ?

Locally it didn't display correctly.

Huh! alright. Well, as long as it works in the deployed docs it doesn't matter too much one way or another:)

st-- · 2022-02-17T15:10:42Z

examples/train-kernel-parameters/script.jl

+
+# ## Using ParameterHandling.jl
+# Alternatively, we can use the [ParameterHandling.jl](https://github.com/invenia/ParameterHandling.jl) package 
+# to handle the requirement that all kernel parameters should be positive. 


might be nice to also comment on it allowing arbitrary nested namedtuples for easily accessing parameters without worrying about where in the flat param vector they are, their constraints, etc...

st-- · 2022-02-17T15:12:23Z

examples/train-kernel-parameters/script.jl

+)
+
+flat_θ, unflatten = ParameterHandling.value_flatten(raw_initial_θ)
+nothing #hide


Do you actually need these? I think Literate.jl only splits up code cells at non-code markdown (comments), so all of this would run as a single cell and its return values wouldn't get printed anyways?

You are probably right, and this is some artifact from me moving things around.

examples/train-kernel-parameters/script.jl

st-- · 2022-02-17T15:14:02Z

examples/train-kernel-parameters/script.jl

+# ## Flux.destructure
+# If don't want to write an explicit function to construct the kernel, we can alternatively use the `Flux.destructure` function. 
+# Again, we need to ensure that the parameters are positive. Note that the `exp` function is now part of the loss function, instead of part of the kernel construction. 
+# We could also use ParameterHandling.jl here, similar to the example above. 


not clear to me, how would you combine the two? I had thought it was either-or..

My impression was that one could do a kernelc \circ flatten (or unflatten) construction.

if you think it's something one would actually want to do in practice, it'd be great to expand on it a bit more, but otherwise I would prefer removing the reference so as to not confuse people (like myself reading it now 😅 )

st-- · 2022-02-17T15:14:30Z

examples/train-kernel-parameters/script.jl

+
+p, kernelc = Flux.destructure(kernel);
+
+# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.


Suggested change

# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.

# This returns the "trainable" parameters of the kernel and a function to reconstruct the kernel.

or did you mean for trainable to be typeset as code ?

as typeset in code, for Flux.trainable (which I understand from another PR might be going away, but still).

Flux.trainable isn't exported (and barely documented - the only reference I could find was in https://github.com/FluxML/Flux.jl/blob/4a3483efd8e13437b3d86371723c977fb61c2793/docs/src/models/advanced.md, but that doesn't seem relevant here), so as it is it seems to me to be rather confusing. Maybe

Suggested change

# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.

# This returns the trainable `params` of the kernel and a function to reconstruct the kernel.

?
or just

Suggested change

# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.

# This returns the trainable parameters of the kernel and a function to reconstruct the kernel from these parameters.

examples/train-kernel-parameters/script.jl

st-- · 2022-02-17T15:15:56Z

examples/train-kernel-parameters/script.jl

+
+kernel = (θ[1] * SqExponentialKernel() + θ[2] * Matern32Kernel()) ∘ ScaleTransform(θ[3])
+
+p, kernelc = Flux.destructure(kernel);


what's the p? what is it needed for? what does it do?

Probably not needed anymore. Initially I wanted to show that kernelc(p) == kernel, but due to a mutability issue, this does not evaluate as true, despite them being == in almost all respects.

st--

Thanks, I think this is a useful addition to make clear that you can train kernel parameters (and how to do so); I've left minor comments as github suggestions, there's a few other comments that would be good if you could address before it's ready to merge. Let me know if anything doesn't make sense!

Co-authored-by: st-- <[email protected]>

st-- · 2022-02-18T14:54:35Z

@Crown421 let me know once you've addressed my remaining comments (either through changes or by replying what you doesn't make sense to you / you disagree with:)), and I'll finish off the review!

Crown421 · 2022-03-20T00:22:14Z

@st-- I think I addressed all comments now. Apologies for the delay, I also managed to catch covid since we spoke.

examples/train-kernel-parameters/script.jl

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

st--

This looks like a very helpful addition, thanks for contributing!

there's just one remaining comment on the loss comment:) let me know what you want to do with that, then we can merge it

Co-authored-by: st-- <[email protected]>

st-- and others added 19 commits June 30, 2021 17:13

initial version of training kernel parameters example from st/examples (

5aca056

#234)

Merge branch 'master' into st/examples--train-kernel-parameters

f07c07a

update script

2e54a35

Apply suggestions from code review

74315f8

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Merge branch 'master' into st/examples--train-kernel-parameters

923b342

fix out-of-domain initial value

88f1b4d

Merge branch 'master' into st/examples--train-kernel-parameters

e5dc5a3

Merge branch 'master' into st/examples--train-kernel-parameters

fa97bd4

initial version of training kernel parameters example from st/examples (

12ee8ac

#234)

update script

0c76b7a

Apply suggestions from code review

3569b57

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

fix out-of-domain initial value

9e1f96a

Add ParameterHandling

30171c9

Extend example

a6d6e5b

Failed attempts with default Flux opt

f512e47

Add Flux.destructure example

0fb3d7f

Add some nothing

6907d71

Merge branch 'st/examples--train-kernel-parameters' of github.com:Jul…

20a9819

…iaGaussianProcesses/KernelFunctions.jl into st/examples--train-kernel-parameters

github-actions bot reviewed Jan 28, 2022

View reviewed changes

Crown421 added 2 commits January 28, 2022 15:57

change title

fea9646

Update manifest

b5c1513

Crown421 added 2 commits January 28, 2022 16:01

Formatter

70e9156

Merge branch 'master' into train-kernel-ex2

e655a0a

Some small updates

6a9e756

st-- reviewed Feb 17, 2022

View reviewed changes

examples/train-kernel-parameters/script.jl Outdated Show resolved Hide resolved

st-- reviewed Feb 17, 2022

View reviewed changes

examples/train-kernel-parameters/script.jl Outdated Show resolved Hide resolved

st-- reviewed Feb 17, 2022

View reviewed changes

examples/train-kernel-parameters/script.jl Outdated Show resolved Hide resolved

st-- reviewed Feb 17, 2022

View reviewed changes

examples/train-kernel-parameters/script.jl Outdated Show resolved Hide resolved

st-- reviewed Feb 17, 2022

View reviewed changes

Apply suggestions from code review

b1711bb

Co-authored-by: st-- <[email protected]>

Crown421 added 3 commits March 20, 2022 00:15

Address comments

eaf6b80

Missed one

e6d5cc2

Merge branch 'master' into train-kernel-ex2

5fee0c3

Manifest issue

cc65ad5

github-actions bot reviewed Mar 20, 2022

View reviewed changes

Crown421 and others added 2 commits March 20, 2022 00:25

Apply suggestions from formatter

01bdb28

Co-authored-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>

Local formatter

b4f4532

st-- approved these changes Mar 21, 2022

View reviewed changes

Crown421 and others added 2 commits March 21, 2022 12:00

Delete loss description.

e0acaf4

Co-authored-by: st-- <[email protected]>

format

04a5d98

st-- merged commit 6033b56 into master Mar 21, 2022

st-- deleted the train-kernel-ex2 branch March 21, 2022 14:04

st-- mentioned this pull request Mar 21, 2022

Fixup: ] dev ../.. for train-kernel-parameters example #441

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated Train- kernel-parameters example #430

Updated Train- kernel-parameters example #430

Crown421 commented Jan 28, 2022

codecov bot commented Jan 28, 2022 •

edited

Loading

Crown421 commented Jan 28, 2022

devmotion commented Jan 28, 2022

devmotion commented Jan 28, 2022

Crown421 commented Jan 28, 2022

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- Feb 17, 2022

st-- Feb 17, 2022 •

edited

Loading

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- Feb 18, 2022

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- Feb 17, 2022

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- Feb 17, 2022

st-- Feb 17, 2022

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- Feb 18, 2022

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- Feb 18, 2022

st-- Feb 17, 2022

Crown421 Feb 17, 2022

st-- left a comment

st-- commented Feb 18, 2022

Crown421 commented Mar 20, 2022

st-- left a comment

	return sum(abs2, y_train - ŷ) + exp(θ[4]) * norm(ŷ)
	return norm(y_train - ŷ) + exp(θ[4]) * norm(ŷ)

	θ = log.([1.1, 0.1, 0.01, 0.001]) # Initial vector
	θ = log.(ones(4)) # Initial vector


		# Computational cost for one step

		@benchmark let θt = θ[:], optt = Optimise.ADAGrad(0.5)


		p, kernelc = Flux.destructure(kernel);

		# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.

	# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.
	# This returns the "trainable" parameters of the kernel and a function to reconstruct the kernel.

	# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.
	# This returns the trainable `params` of the kernel and a function to reconstruct the kernel.

	# This returns the `trainable` parameters of the kernel and a function to reconstruct the kernel.
	# This returns the trainable parameters of the kernel and a function to reconstruct the kernel from these parameters.


		kernel = (θ[1] * SqExponentialKernel() + θ[2] * Matern32Kernel()) ∘ ScaleTransform(θ[3])

		p, kernelc = Flux.destructure(kernel);

Updated Train- kernel-parameters example #430

Updated Train- kernel-parameters example #430

Conversation

Crown421 commented Jan 28, 2022

codecov bot commented Jan 28, 2022 • edited Loading

Codecov Report

Crown421 commented Jan 28, 2022

devmotion commented Jan 28, 2022

devmotion commented Jan 28, 2022

Crown421 commented Jan 28, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

st-- Feb 17, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

st-- left a comment

Choose a reason for hiding this comment

st-- commented Feb 18, 2022

Crown421 commented Mar 20, 2022

st-- left a comment

Choose a reason for hiding this comment

codecov bot commented Jan 28, 2022 •

edited

Loading

st-- Feb 17, 2022 •

edited

Loading