IO: Replaced factories.arraay() with DNDarray #951

shahpratham · 2022-04-02T10:55:18Z

Description

Replaced factories.array with DNDarray in io.py
Issue/s resolved: #797

Changes proposed:

factories.array -> dndarray.DNDarray

Type of change

enhancement

Due Diligence

All split configurations tested
Multiple dtypes tested in relevant functions
Documentation updated (if needed)
Updated changelog.md under the title "Pending Additions"

Does this change modify the behaviour of other functions? If so, which?

no

mtar · 2022-04-02T10:55:21Z

GPU cluster tests are currently disabled on this Pull Request.

ghost · 2022-04-02T10:56:33Z

CodeSee Review Map:

Review in an interactive map

View more CodeSee Maps

Legend

shahpratham · 2022-04-02T11:03:11Z

@ClaudiaComito @mtar Please tell if any changes are required.

ClaudiaComito

Hey @shahpratham thanks a lot for jumping in and well done, the use cases are correct, we want to save us some communication time that arises with factories.array() when is_split is not None.

However gshape is the global shape of the DNDarray, and you cannot derive balanced from the process-local torch tensor. More in the comments below.

ClaudiaComito · 2022-04-04T13:07:14Z

heat/core/io.py

-            local_tensor, dtype=dtype, is_split=0, device=device, comm=comm
+        resulting_tensor = DNDarray(
+            local_tensor,
+            gshape=tuple(local_tensor.shape),


gshape is supposed to be the global shape of the memory-distributed array resulting_tensor. Here you're setting it to the shape of local_tensor, which is basically a slice of the global array.

If we have all the information we need to calculate gshape without communication among processes, then we can call DNDarray(...), otherwise we need to use factories.array() and that will take care of the comm

Okay, understood. So can I make a copy of that tensor before slicing it and pass gshape=tuple(local_tensor_copy.shape) to it?

ClaudiaComito · 2022-04-04T13:17:10Z

heat/core/io.py

+            split=0,
+            device=device,
+            comm=comm,
+            balanced=local_tensor.is_balanced,


local_tensor is a torch.Tensor. It doesn't "know" about being a slice of a larger distributed array.

is_balanced() is a method of the DNDarray class, it has to do with whether the memory-distributed DNDarray is distributed evenly among the available processes.

If we cannot asses load balance of the output DNDarray (I'm not sure that's the case here, I haven't checked), we set balanced = None.

ClaudiaComito · 2022-04-04T13:17:44Z

heat/core/io.py

-        resulting_tensor = factories.array(data, dtype=dtype, is_split=1, device=device, comm=comm)
+        resulting_tensor = DNDarray(
+            data,
+            gshape=tuple(data.shape),


ClaudiaComito · 2022-04-04T13:17:51Z

heat/core/io.py

+            split=1,
+            device=device,
+            comm=comm,
+            balanced=data.is_balanced,


…atham

shahpratham · 2022-04-06T05:19:26Z

Hey @ClaudiaComito, I have made some changes, kindly review.

ClaudiaComito · 2022-04-07T03:31:45Z

heat/core/io.py

-            local_tensor, dtype=dtype, is_split=0, device=device, comm=comm
+        resulting_tensor = DNDarray(
+            local_tensor,
+            gshape=local_shape,


Hi @shahpratham gshape is the global shape, not the local shape.
This function reads an array that might be, say, (1billion x 1000) in size (just making the size up). The data will be distributed on many processes, i.e. each process will read only a specific subset of lines (if split=0) or columns (if split=1) out of that file, and store them in local_tensor.

So local_tensor is the process-local slice of data. Its shape, local_shape may vary (depending on number of processes for example. See communication.chunk() ). But the global shape gshape will always be (1billion x 1000).

But the global shape gshape will always be (1billion x 1000).

Yes, so I need to get global shape of the csv file, after reading it, like how its done in load_hdf5 function(on line 121) and in load_netcdf function (on line 334), right?

ClaudiaComito · 2023-01-30T04:32:42Z

@shahpratham now you know everything about Heat - let's merge this PR, can you update? Thanks!

shahpratham · 2023-02-05T13:41:50Z

yes, sorry I forgot about this.
I have created a new PR #1089 , due to some issue with my previous branch.

ClaudiaComito · 2023-08-18T12:02:46Z

This is addressed in #1089, closing

replaced factories.arraay() with DNDarray

97fae40

ClaudiaComito requested changes Apr 4, 2022

View reviewed changes

shahpratham and others added 4 commits April 5, 2022 22:46

Merge branch 'main' into pratham

dd4ff41

Merge branch 'main' of https://github.com/shahpratham/heat into pratham

a158c8d

fixed gshape

5d49597

Merge branch 'pratham' of https://github.com/shahpratham/heat into pr…

02dd9a1

…atham

ClaudiaComito reviewed Apr 7, 2022

View reviewed changes

ClaudiaComito added the PR talk label Jun 13, 2022

ClaudiaComito removed the PR talk label Jul 4, 2022

ClaudiaComito added the GSoC label Sep 26, 2022

ClaudiaComito added this to the Repo Clean-Up milestone Jul 31, 2023

Merge branch 'main' into pratham

0265ac6

ClaudiaComito closed this Aug 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

IO: Replaced factories.arraay() with DNDarray #951

IO: Replaced factories.arraay() with DNDarray #951

shahpratham commented Apr 2, 2022

mtar commented Apr 2, 2022

ghost commented Apr 2, 2022 •

edited by ghost

Loading

shahpratham commented Apr 2, 2022

ClaudiaComito left a comment •

edited

Loading

ClaudiaComito Apr 4, 2022

shahpratham Apr 4, 2022

ClaudiaComito Apr 4, 2022

ClaudiaComito Apr 4, 2022

ClaudiaComito Apr 4, 2022

shahpratham commented Apr 6, 2022

ClaudiaComito Apr 7, 2022

shahpratham Apr 7, 2022

ClaudiaComito commented Jan 30, 2023

shahpratham commented Feb 5, 2023

ClaudiaComito commented Aug 18, 2023

IO: Replaced factories.arraay() with DNDarray #951

IO: Replaced factories.arraay() with DNDarray #951

Conversation

shahpratham commented Apr 2, 2022

Description

Changes proposed:

Type of change

Due Diligence

Does this change modify the behaviour of other functions? If so, which?

mtar commented Apr 2, 2022

ghost commented Apr 2, 2022 • edited by ghost Loading

CodeSee Review Map:

Legend

shahpratham commented Apr 2, 2022

ClaudiaComito left a comment • edited Loading

Choose a reason for hiding this comment

ClaudiaComito Apr 4, 2022

Choose a reason for hiding this comment

shahpratham Apr 4, 2022

Choose a reason for hiding this comment

ClaudiaComito Apr 4, 2022

Choose a reason for hiding this comment

ClaudiaComito Apr 4, 2022

Choose a reason for hiding this comment

ClaudiaComito Apr 4, 2022

Choose a reason for hiding this comment

shahpratham commented Apr 6, 2022

ClaudiaComito Apr 7, 2022

Choose a reason for hiding this comment

shahpratham Apr 7, 2022

Choose a reason for hiding this comment

ClaudiaComito commented Jan 30, 2023

shahpratham commented Feb 5, 2023

ClaudiaComito commented Aug 18, 2023

ghost commented Apr 2, 2022 •

edited by ghost

Loading

ClaudiaComito left a comment •

edited

Loading