Lda opt #99

hyu596 · 2018-10-15T02:04:35Z

This PR implements distributed Latent Dirichlet Allocation using the Cirrus system and interface.

Update master branch in the forked repo

Sync with master

jcarreira

Please take a look.

jcarreira · 2019-02-24T05:03:41Z

python/frontend/cirrus/cirrus/lda.py

+
+        return config
+
+    def launch_ps(self, command_dict=None):


This file needs some work to improve a few things:

The lambdas should be kept alive by the automate.py (as is done in the other algorithms).

The parameter server should be launched by the automate.py (see logistic_regression,ipynb)

jcarreira · 2019-02-24T05:05:33Z

src/Configuration.h

    };

+    static constexpr int WORKERS_BASE[5] = {-1, 3, -1, -1, 4};


I don't think this is used.

jcarreira · 2019-02-24T05:45:12Z

src/LDAModel.cpp

+  K_ = load_value<int16_t>(info);
+
+  // load the current topic assignments
+  t.clear();


Because this is the constructor, t/d/w are empty vectors. No need for clear.

jcarreira · 2019-02-24T05:51:46Z

src/Tasks.h

+      std::vector<std::vector<int>>& topic_scope);
+
+ private:
+  std::array<int, VOCAB_DIM_UPPER> lookup_map;


This is a bit dangerous. If VOCAB_DIM_UPPER is too big it can lead to crashes be cause std::array puts data on the stack.

You can replace this with an std::vector and then resize to VOCAB_DIM_UPPER.

This happens in other places. We can fix this later.

lookup_map is now vector and its initializing part (filling with -1) is in the beginning of LoadingLDATaskS3.run function.

jcarreira · 2019-02-24T05:59:20Z

src/LoadingLDATaskS3.cpp

+  // Storing local variables (LDAStatistics)
+  for (unsigned int i = 1; i < num_s3_objs + 1; ++i) {
+    // for (unsigned int i = 1; i < 3; ++i) {
+    std::vector<int> w;


Move this to right before count_dataset().

jcarreira · 2019-02-24T22:41:57Z

src/LDATaskS3.cpp

+        float est_time_one_iter =
+            ((get_time_ms() - start_time) / 1000.) / full_iteration;
+
+        if (elapsed_sec > (lambda_time_out - est_time_one_iter - 10.)) {


Too much code here. Would put this code below in a separate function.

Code refactoring has been done. Please let me know if it remains messy.

jcarreira · 2019-02-24T22:43:01Z

src/LDATaskS3.cpp

+      }
+    }
+    int since_start_sec = (get_time_ms() - start_time) / 1000;
+    if (since_start_sec > benchmark_time) {


Same thing, this should go into separate function

^^Code refactoring has been done. Please let me know if it remains messy.

jcarreira · 2019-02-24T22:47:19Z

src/Tasks.h

+   */
+  void load_serialized_indices(char* mem_begin);
+
+  std::vector<std::unique_ptr<std::thread>> help_upload_threads;


Doesn't seem used

It's being used now [1]. Previously help_upload_threads was removed since in all the experiments I ensured that each worker was assigned with exactly one LDAStatistics and thus it was not needed.

[1] https://github.com/hyu596/cirrus-1/blob/lda_opt/src/LDATaskS3.cpp#L201

Work has been done to ensure it works fine.

jcarreira · 2019-02-24T22:48:17Z

src/LDATaskS3.cpp

+                                        uint64_t to_send_size,
+                                        int& upload_lock,
+                                        int bucket_id) {
+  while (true) {


Don't understand this code. As far as I can understand this task is single threaded (help_upload_threads is not used).

jcarreira · 2019-02-24T22:55:51Z

bootstrap.sh

+fi
+cd lz4-dev
+make
+make install


Right now the LDA tests are failing on travis because we don't have permissions to do 'make install' there. To fix this do:

Remove this make install and sudo make install.

Add to the .travis.yml file an "apt-get install liblz4-dev".

Add liblz4-dev to the cirrus/README

hyu596 and others added 30 commits July 3, 2018 00:53

Pushing all the current LDA progress

8b8b04a

Update

13d77ef

Quick update; debugging

03eb381

Switching VM; quick push

a7bd664

Update: LDA working with S3

ced887a

Clearing

c0a53c5

Add back the missing Makefile

749a58c

Fixing issue

23de928

Merge pull request #1 from jcarreira/master

8269305

Update master branch in the forked repo

Solved conflicts

a70a45d

Quick fix

353f09e

Fix formatting

1e09937

Finish travis test for lda

eb800e0

Fix formatting

b0ce881

Quick fix for formatting

99bc2b4

Quick fix the travis test & pushing the small dataset for travis test

3150227

Quick push

005163a

Fix Makefiles

7ddc346

Delete test data

bdd6191

fix conflict

d291248

Merge branch 'lda' of https://github.com/hyu596/cirrus-1 into lda

ac45905

Fix for lamda

9d48cdd

Quick fix

f64d2f1

Working on Python Interface; switch VM

b602cdc

Impoved the computation of ll

aa06b49

Writing ll to file

f1a8cd9

Quick fix for ll

0c05f6b

Clean the code

5bab83b

Clean and fix the code

4b0c691

Add improvement of ll init, benchmark, few metrics and optimizations

8c45ed8

Global var replaced with class var

767aeb5

hyu596 force-pushed the lda_opt branch 4 times, most recently from e319875 to 767aeb5 Compare January 31, 2019 22:12

hyu596 and others added 2 commits January 31, 2019 14:14

Merge pull request #7 from jcarreira/master

8f03dc7

Sync with master

Rank rework

a2ca9de

hyu596 closed this Jan 31, 2019

hyu596 reopened this Jan 31, 2019

jcarreira added 3 commits January 31, 2019 23:11

format

c7bbfb9

test for permission

2b56f44

minor change

a79bb41

hyu596 closed this Feb 14, 2019

hyu596 reopened this Feb 14, 2019

jcarreira and others added 5 commits February 14, 2019 21:37

quick push

0834662

Merge branch 'master' into lda_opt

940536d

merge master into branch

1529abb

formatting

050f2da

formatting

1a7bb6d

jcarreira requested changes Feb 24, 2019

View reviewed changes

jcarreira added 10 commits February 25, 2019 16:45

Changes required by comments

d23be4b

Changes for the comments

7c45a77

changes for the comments

8f19f22

code refac for comments

01a605c

adding lz4 installation

f09d5e1

removing make install for lz4 from bootstrap.sh

24edf1c

Formatting

6ffff5f

tworking on lz4 header

012aa64

formatting

af50d50

few changes from previous meeting

41fb741

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lda opt #99

Lda opt #99

hyu596 commented Oct 15, 2018 •

edited by jcarreira

Loading

jcarreira left a comment

jcarreira Feb 24, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019 •

edited

Loading

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

hyu596 Feb 25, 2019

jcarreira Feb 24, 2019

		};

		static constexpr int WORKERS_BASE[5] = {-1, 3, -1, -1, 4};

Lda opt #99

Are you sure you want to change the base?

Lda opt #99

Conversation

hyu596 commented Oct 15, 2018 • edited by jcarreira Loading

jcarreira left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hyu596 Feb 25, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hyu596 commented Oct 15, 2018 •

edited by jcarreira

Loading

hyu596 Feb 25, 2019 •

edited

Loading