Continuous attribute embedding #102

jmsfltchr · 2019-10-07T10:54:37Z

What is the goal of this PR?

Enable ingesting numerical attributes with continuous values.

The aim has been to add a continuous numerical attribute to the diagnosis example which adds no additional information. In this case, the model should be able to stably achieve the same performance as without this attribute. Empirically, this has been achieved, perhaps taking longer to converge (about 500 training iterations minimum, compared to a minimum of 250 iterations prior).

Closes #99

What are the changes implemented in this PR?

Introduce a continuous numerical attribute into the diagnosis example (severity)
Create a ContinuousAttribute model, consisting of an MLP followed by layer normalisation
For debugging and monitoring purposes, add histograms at strategic points in the model, plus the relevant code to execute and store these summaries
Normalisation of common (type) embeddings for improved stability, such that attribute and type embeddings have similar magnitudes
Gradient clipping for a very significant improvement in model convergence and stability
Dropout for continuous attribute MLP to combat overfitting
Improved the flow of embedder construction

…uild continuous attribute embedders

…te embedders amd to be more robust

… preventing overfitting. Reflected in the tensorboard histograms when learning was stagnant/non-existent in the attribute embedders before introducing dropout

… corresponding reporting metrics. This has made convergence to good performance 4-8 times faster, from 100-200 iterations down to 250 iterations

…d performance. Adding more examples (200), and using a useless attribute always with value zero, the model converges to a good and expected accuracy. Remove dropout for continuous attributes. Add histograms and better output location for plots

… extra information over the existing type information) works but leads to overfitting

… best combination for stability and minimising overfitting

…ne with the attribute embeddings

…lity and speed of convergence of the model

Now that gradient clipping and type embedding normalisation are in place, using dropout as intended in the continuous attribute MLP works very well to counter overfitting whilst keeping good stability. At least, for the first 2000 iterations, possibly trying to overfit subsequent to that

… not specified and therefore summaries should not be written

flyingsilverfin · 2019-10-07T11:51:17Z

kglib/kgcn/learn/learn.py

-        step_op = optimizer.minimize(loss_op_tr)
+        gradients, variables = zip(*optimizer.compute_gradients(loss_op_tr))
+
+        for grad, var in zip(gradients, variables):


this for debugging?

Yes and also functional - optimiser.minimize(loss) computes the gradients and applies them. In order to intercept the gradients to do anything with them you have to first compute them and then apply them manually.

Here we do this to:

Visualise the gradients in TensorBoard

Apply gradient clipping as mentioned in the description

flyingsilverfin · 2019-10-07T12:01:05Z

kglib/kgcn/pipeline/pipeline.py

@@ -136,3 +124,57 @@ def make_blank_embedder():

    _, _, _, _, _, solveds_tr, solveds_ge = tr_info
    return ge_graphs, solveds_tr, solveds_ge
+
+
+def configure_embedders(node_types, attr_embedding_dim, categorical_attributes, continuous_attributes):


this is a bit awkward to read... maybe it can be split into separate file/class?

Agreed, it's generally awkward and needs more architectural work, including how to expose this configurability to the user. Intending to address this in a separate PR as it's a big task

flyingsilverfin · 2019-10-07T12:01:24Z

kglib/kgcn/pipeline/pipeline_test.py

+from kglib.kgcn.pipeline.pipeline import configure_embedders
+
+
+class TestConstructEmbedders(unittest.TestCase):


already being tested separatedly!

You mean configure_embedders is? Whereabouts?

The name of the test is incorrect though, should be TestConfigureEmbedders

James Fletcher added 24 commits September 19, 2019 16:48

Introduce a base class for all Attribute embedding models

6e15638

A basic implementation of embedding for continuous attributes

63764ef

Update pipeline

c4be1a8

Merge branch 'master' into continuous-attribute-embedding

12981f4

Change the control flow of graph transformation in the pipeline and b…

f25c659

…uild continuous attribute embedders

Add synthetic generation of continuous double attributes

ee30fb3

Alter flow of embedder construction to accommodate continuous attribu…

21a7410

…te embedders amd to be more robust

Add some basic histograms for the embedding of attributes

315588e

Results improve when dropout is introduced for continuous attribtues,…

b368d59

… preventing overfitting. Reflected in the tensorboard histograms when learning was stagnant/non-existent in the attribute embedders before introducing dropout

Use a loss function that ignores preexisting graph elements, and adds…

174fb57

… corresponding reporting metrics. This has made convergence to good performance 4-8 times faster, from 100-200 iterations down to 250 iterations

Alter loss function to only penalise for incorrect nodes, not edges

a03a7c8

Merge branch 'learning-stability' into continuous-attribute-embedding

982b72f

Merge branch 'master' into continuous-attribute-embedding

90e838b

Using non-zero continuous values which carry some meaning (but add no…

9e2109c

… extra information over the existing type information) works but leads to overfitting

Empirically 3 layers and subsequent dropout of 0.5 is empirically the…

523add9

… best combination for stability and minimising overfitting

Create histograms of the gradients of the variables during training

3c8ffd7

Normalise type embeddings in order to increase their magnitudes in li…

9addf8f

…ne with the attribute embeddings

Implement gradient clipping, which seems to greatly improve the stabi…

231bcb1

…lity and speed of convergence of the model

Reduce number of training iterations

b44babd

Change control flow of learning for the case where a log directory is…

7be777a

… not specified and therefore summaries should not be written

Fix embedding tests

ced063d

Reduce to 1000 training iterations default

44a920a

jmsfltchr added the type: feature label Oct 7, 2019

jmsfltchr added this to the 0.2.1 milestone Oct 7, 2019

jmsfltchr self-assigned this Oct 7, 2019

jmsfltchr requested a review from flyingsilverfin October 7, 2019 11:48

jmsfltchr mentioned this pull request Oct 7, 2019

Embed Continuous Attributes #99

Closed

flyingsilverfin reviewed Oct 7, 2019

View reviewed changes

Correct test name

8daccec

flyingsilverfin self-requested a review October 7, 2019 12:56

flyingsilverfin approved these changes Oct 7, 2019

View reviewed changes

jmsfltchr merged commit 18af984 into typedb:master Oct 7, 2019

jmsfltchr deleted the continuous-attribute-embedding branch October 7, 2019 13:45

jmsfltchr mentioned this pull request Dec 17, 2019

Feature normaliser by attribute type #20

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Continuous attribute embedding #102

Continuous attribute embedding #102

jmsfltchr commented Oct 7, 2019

flyingsilverfin Oct 7, 2019

jmsfltchr Oct 7, 2019

flyingsilverfin Oct 7, 2019

jmsfltchr Oct 7, 2019

jmsfltchr Oct 7, 2019

flyingsilverfin Oct 7, 2019

jmsfltchr Oct 7, 2019

		from kglib.kgcn.pipeline.pipeline import configure_embedders


		class TestConstructEmbedders(unittest.TestCase):

Continuous attribute embedding #102

Continuous attribute embedding #102

Conversation

jmsfltchr commented Oct 7, 2019

What is the goal of this PR?

What are the changes implemented in this PR?

flyingsilverfin Oct 7, 2019

Choose a reason for hiding this comment

jmsfltchr Oct 7, 2019

Choose a reason for hiding this comment

flyingsilverfin Oct 7, 2019

Choose a reason for hiding this comment

jmsfltchr Oct 7, 2019

Choose a reason for hiding this comment

jmsfltchr Oct 7, 2019

Choose a reason for hiding this comment

flyingsilverfin Oct 7, 2019

Choose a reason for hiding this comment

jmsfltchr Oct 7, 2019

Choose a reason for hiding this comment