Neighbour analysis for the entire trajectory #251

sudarsan-surendralal · 2021-06-27T07:53:13Z

The idea is to easily perform a neighbor analysis over the trajectory of an atomistic simulation. This is something I've been doing in my notebooks for some time which could be useful to others as well. There are also ways to make this more efficient. So please feel free to suggest improvements!

coveralls · 2021-06-27T08:01:08Z

Pull Request Test Coverage Report for Build 1045104153

65 of 67 (97.01%) changed or added relevant lines in 2 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.1%) to 68.031%

Changes Missing Coverage	Covered Lines	Changed/Added Lines	%
pyiron_atomistics/atomistics/job/atomistic.py	26	27	96.3%
pyiron_atomistics/atomistics/structure/neighbors.py	39	40	97.5%

Totals
Change from base Build 1037555151:	0.1%
Covered Lines:	10889
Relevant Lines:	16006

💛 - Coveralls

pyiron_atomistics/atomistics/job/atomistic.py

samwaseda · 2021-06-27T08:38:45Z

I see that the results are stored in private variables. How are they supposed to be used?

User friendly functions to get the neighbors for the required part of the trajetory

sudarsan-surendralal · 2021-06-28T08:14:52Z

I see that the results are stored in private variables. How are they supposed to be used?

I've updated the structure of the class! This can now be accessed by

neighbors = job.get_neighbors()
neighbors.neighbor_vectors

samwaseda · 2021-06-28T08:22:38Z

Currently it's mostly equivalent to [job.get_structure(i).get_neighbors() for i in range(given_range)], but I presume that this class is going to be extended to do more physics in the future, right? Can you tell (roughly, no detail needed) how you use it (i.e. what kind of physical quantities do you usually extract in your work)? Depending on the purpose I might have different feedback.

sudarsan-surendralal · 2021-06-28T08:32:30Z

Currently it's mostly equivalent to [job.get_structure(i).get_neighbors() for i in range(given_range)],

In principle, yes. Such code is trivial but I have to repeat this in pretty much all my notebooks. Options to do this only for certain parts of the trajectory is also pretty useful for me

but I presume that this class is going to be extended to do more physics in the future, right? Can you tell (roughly, no detail needed) how you use it (i.e. what kind of physical quantities do you usually extract in your work)? Depending on the purpose I might have different feedback.

I'll mostly use this to analyze bond breaking/formation for molecules (reactivity), species sensitive radial distribution functions and so on!

pyiron_atomistics/atomistics/job/atomistic.py

pmrv · 2021-07-02T09:00:57Z

pyiron_atomistics/atomistics/structure/neighbors.py

+
+    """
+
+    def __init__(self, init_structure=None, positions=None, cells=None, num_neighbors=12, **kwargs):


I think it'd be extremely useful for this to take any object that implements HasStructure instead of passing all of these things separately. Calling get_structure every step might be a bit slower, but since you're doing essentially what AtomisticGenericJob.get_structure does anyway, it shouldn't be that much.

If it turns out much slower, then I'd say we add a method roughly like this

@classmethod def from_structures(cls, has_structure): structs = list(has_structure.iter_structures()) return cls(structs[0], [s.positions for s in structs], [s.cell for s in structs])

to NeighborTraj to achieve the same generality, but offer a small escape hatch for cases where we know get_structure is too slow.

If it turns out much slower, then I'd say we add a method roughly like this

@classmethod def from_structures(cls, has_structure): structs = list(has_structure.iter_structures()) return cls(structs[0], [s.positions for s in structs], [s.cell for s in structs])

to NeighborTraj to achieve the same generality, but offer a small escape hatch for cases where we know get_structure is too slow.

I think this is a great way to generalize this for any derivative of HasStructure. However, I don't think building the positions and cells from the individual structures is not that efficient.

So how about something like this:

def __init__(self, has_structure=None, init_struct=None, positions=None, cells=None): if has_structure is None: if positions is None or init_struct is None: raise ValueError()

and then

def _get_neighbors_hs(has_struct, num_neighbors=20, **kwargs): [struct.get_neighbors() for struct in has_struct.iter_structures()]

which gets called instead of _get_neighbors

I think we should simply give it a try to use get_structure() everywhere and see if it really is slower. Checking Trajectory._get_structure and your code in _get_neighbors() looks essentially the same to me, so I don't expect a big hit.

If you don't want to do it, I can give it a go tonight.

Also __init__ needs to accept an init argument as the first argument and pass it to super().__init__(), otherwise recursive loading from HDF will not work.

I think we should simply give it a try to use get_structure() everywhere and see if it really is slower. Checking Trajectory._get_structure and your code in _get_neighbors() looks essentially the same to me, so I don't expect a big hit.

If you don't want to do it, I can give it a go tonight.

Sure I'll merge your PR into this one and see what happens!

Also __init__ needs to accept an init argument as the first argument and pass it to super().__init__(), otherwise recursive loading from HDF will not work.

Not sure what you mean by this. Could you clarify?

Also __init__ needs to accept an init argument as the first argument and pass it to super().__init__(), otherwise recursive loading from HDF will not work.

Not sure what you mean by this. Could you clarify?

When you derive from DataContainer the new __init__ method must be compatible to the original one. One of the ways DataContainer is instantiated is by passing a collection to initialize it, like so

d = DataContainer({'a': 1, 'b': 3, 2: 42}) assert d[1] == 3

Because this is used by the DataContainer internally when loading from hdf, subclasses also need to offer this. There's a little bit more explanation here in the attention box.

# Conflicts: # pyiron_atomistics/atomistics/job/atomistic.py

Co-authored-by: Marvin Poul <[email protected]>

pmrv · 2021-07-07T06:11:21Z

pyiron_atomistics/atomistics/structure/neighbors.py

+    def __init__(self, init_structure=None, positions=None, cells=None, num_neighbors=12, **kwargs):
+        """
+
+        Args:
+            init_structure (pyiron_atomistics.atomistics.structure.atoms.Atoms): Any given structure of the trajectory
+            positions (numpy.ndarray): The cartesian positions of the trajectories
+            cells (numpy.ndarray/None): The varying cell shapes
+            num_neighbors (int): The cutoff for the number of neighbors
+            **kwargs (dict): Additional arguments to be passed to the `get_neighbors()` routine
+                             (eg. cutoff_radius, norm_order , etc.)
+        """
+        self._init_structure = init_structure
+        self._neighbor_indices = None
+        self._neighbor_distances = None
+        self._neighbor_vectors = None
+        self._positions = positions
+        self._cells = cells
+        self._num_neighbors = num_neighbors
+        self._get_neighbors_kwargs = kwargs


For __init__ to be compatible a change like this would be necessary. When init is given the other parameters are not set, because presumably they are set in the init read from HDF5. If you want to be complete, you can check that init really contains all the attributes that __init__ normally sets and set those manually that are not in `init.

Suggested change

def __init__(self, init_structure=None, positions=None, cells=None, num_neighbors=12, **kwargs):

"""

Args:

init_structure (pyiron_atomistics.atomistics.structure.atoms.Atoms): Any given structure of the trajectory

positions (numpy.ndarray): The cartesian positions of the trajectories

cells (numpy.ndarray/None): The varying cell shapes

num_neighbors (int): The cutoff for the number of neighbors

**kwargs (dict): Additional arguments to be passed to the `get_neighbors()` routine

(eg. cutoff_radius, norm_order , etc.)

"""

self._init_structure = init_structure

self._neighbor_indices = None

self._neighbor_distances = None

self._neighbor_vectors = None

self._positions = positions

self._cells = cells

self._num_neighbors = num_neighbors

self._get_neighbors_kwargs = kwargs

def __init__(self, init=None, init_structure=None, positions=None, cells=None, num_neighbors=12, **kwargs):

"""

Args:

init_structure (pyiron_atomistics.atomistics.structure.atoms.Atoms): Any given structure of the trajectory

positions (numpy.ndarray): The cartesian positions of the trajectories

cells (numpy.ndarray/None): The varying cell shapes

num_neighbors (int): The cutoff for the number of neighbors

**kwargs (dict): Additional arguments to be passed to the `get_neighbors()` routine

(eg. cutoff_radius, norm_order , etc.)

"""

super().__init__(init=init)

if init is None:

self._init_structure = init_structure

self._neighbor_indices = None

self._neighbor_distances = None

self._neighbor_vectors = None

self._positions = positions

self._cells = cells

self._num_neighbors = num_neighbors

self._get_neighbors_kwargs = kwargs

# Conflicts: # pyiron_atomistics/atomistics/structure/neighbors.py

sudarsan-surendralal · 2021-07-15T07:15:22Z

Since a HasStructure instance is now an attribute of the neighbor class, calling to_hdf now tries to write the entire trajectory (or structure container) again in the hdf5 file. I think this is a bit redundant!

pyiron_atomistics/atomistics/structure/neighbors.py

pmrv · 2021-07-16T07:45:36Z

@samwaseda & @sudarsan-surendralal I just saw that the attribute names between Neighbors and NeighborsTraj are not consistent. I don't have a preference either way, but I think they should be the same, otherwise it'll be confusing to use.

pyiron_atomistics/atomistics/structure/neighbors.py

Co-authored-by: Marvin Poul <[email protected]>

…mistics into neighbors_traj

# Conflicts: # pyiron_atomistics/atomistics/structure/neighbors.py

⚡ Neighbour analysis for the entire trajectory

f7648be

sudarsan-surendralal added the enhancement New feature or request label Jun 27, 2021

sudarsan-surendralal marked this pull request as draft June 27, 2021 07:53

sudarsan-surendralal added 2 commits June 27, 2021 10:10

pep8

f8f2a95

🐛 Set the correct quantity

bb8b781

samwaseda reviewed Jun 27, 2021

View reviewed changes

pyiron_atomistics/atomistics/job/atomistic.py Outdated Show resolved Hide resolved

sudarsan-surendralal added 11 commits June 27, 2021 10:46

Reduce lines

84c82ac

Make the attributes available as properties

2308729

Move the functions to a new class NeighborTraj

b6456c1

Implement functions

aa5f24f

User friendly functions to get the neighbors for the required part of the trajetory

Important fixes

02611d1

Fix and data type conversion!

6bc829b

Introducing tests

4e1cb35

Improving docstrings

cf219f8

pep8

3341f0d

More functions in the job class!

cc0ec61

Updating tests

80cb83e

sudarsan-surendralal marked this pull request as ready for review June 28, 2021 08:14

sudarsan-surendralal requested review from pmrv and liamhuber June 28, 2021 08:15

Add option to store and retrieve from HDF5 files

9614ef8

pmrv reviewed Jul 2, 2021

View reviewed changes

pyiron_atomistics/atomistics/job/atomistic.py Show resolved Hide resolved

pmrv reviewed Jul 2, 2021

View reviewed changes

pmrv mentioned this pull request Jul 2, 2021

Add HasStructure to Trajectory #270

Merged

Merge remote-tracking branch 'origin/master' into neighbors_traj

7462d71

# Conflicts: # pyiron_atomistics/atomistics/job/atomistic.py

Update pyiron_atomistics/atomistics/structure/neighbors.py

2e12c1e

Co-authored-by: Marvin Poul <[email protected]>

pmrv reviewed Jul 7, 2021

View reviewed changes

sudarsan-surendralal and others added 5 commits July 7, 2021 22:35

Merge remote-tracking branch 'origin/master' into neighbors_traj

5f407a2

# Conflicts: # pyiron_atomistics/atomistics/structure/neighbors.py

Set table name and rely on base class for writing to hdf

3d9792e

Define index variable

e5cfc43

Fix typo

b278379

Merge remote-tracking branch 'origin/nj_has' into neighbors_traj

d9cc947

# Conflicts: # pyiron_atomistics/atomistics/structure/neighbors.py

pmrv self-requested a review July 14, 2021 12:44

pmrv approved these changes Jul 14, 2021

View reviewed changes

sudarsan-surendralal added 2 commits July 14, 2021 22:57

Implement __getitem__ to clice trajectories

9acaa44

Disable loading checks!

7c00690

cleanup!

02413e0

pmrv reviewed Jul 15, 2021

View reviewed changes

pyiron_atomistics/atomistics/structure/neighbors.py Show resolved Hide resolved

pmrv reviewed Jul 15, 2021

View reviewed changes

pyiron_atomistics/atomistics/structure/neighbors.py Outdated Show resolved Hide resolved

pmrv reviewed Jul 16, 2021

View reviewed changes

pyiron_atomistics/atomistics/structure/neighbors.py Outdated Show resolved Hide resolved

sudarsan-surendralal and others added 11 commits July 16, 2021 14:34

Update pyiron_atomistics/atomistics/structure/neighbors.py

ecad9c0

Co-authored-by: Marvin Poul <[email protected]>

Update pyiron_atomistics/atomistics/structure/neighbors.py

4aa7856

Co-authored-by: Marvin Poul <[email protected]>

minor changes

7f23b96

Merge branch 'neighbors_traj' of https://github.com/pyiron/pyiron_ato…

ed5e4f5

…mistics into neighbors_traj

🐛 Fix constructor

c6b9fed

Restoring some checks

ae725e5

Refactoring class name and attributes (Marvin's suggestion)

1b74ecc

Remove unused import

62e39a4

Merge remote-tracking branch 'origin/master' into neighbors_traj

6d51b09

Merge remote-tracking branch 'origin/master' into neighbors_traj

853b376

# Conflicts: # pyiron_atomistics/atomistics/structure/neighbors.py

Merging new neighbor function changes

2d507db

sudarsan-surendralal merged commit e954f09 into master Jul 19, 2021

delete-merged-branch bot deleted the neighbors_traj branch July 19, 2021 12:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Neighbour analysis for the entire trajectory #251

Neighbour analysis for the entire trajectory #251

sudarsan-surendralal commented Jun 27, 2021

coveralls commented Jun 27, 2021 •

edited

Loading

samwaseda commented Jun 27, 2021

sudarsan-surendralal commented Jun 28, 2021

samwaseda commented Jun 28, 2021

sudarsan-surendralal commented Jun 28, 2021

pmrv Jul 2, 2021

pmrv Jul 2, 2021

sudarsan-surendralal Jul 2, 2021

pmrv Jul 6, 2021

pmrv Jul 6, 2021

sudarsan-surendralal Jul 6, 2021

sudarsan-surendralal Jul 6, 2021

pmrv Jul 7, 2021

pmrv Jul 7, 2021

sudarsan-surendralal commented Jul 15, 2021

pmrv commented Jul 16, 2021


		"""

		def __init__(self, init_structure=None, positions=None, cells=None, num_neighbors=12, **kwargs):

Neighbour analysis for the entire trajectory #251

Neighbour analysis for the entire trajectory #251

Conversation

sudarsan-surendralal commented Jun 27, 2021

coveralls commented Jun 27, 2021 • edited Loading

Pull Request Test Coverage Report for Build 1045104153

💛 - Coveralls

samwaseda commented Jun 27, 2021

sudarsan-surendralal commented Jun 28, 2021

samwaseda commented Jun 28, 2021

sudarsan-surendralal commented Jun 28, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sudarsan-surendralal commented Jul 15, 2021

pmrv commented Jul 16, 2021

coveralls commented Jun 27, 2021 •

edited

Loading