SQL Query That Returns The Names of the Clusters for a Class #6152

SDIPro · 2016-05-17T16:57:58Z

We need a SQL query that can return the names of the clusters that are associated with a class.

rdelangh · 2016-07-27T10:47:45Z

+1
I would require this, to identify the clusters which

have not yet been backed up
contain old data that can be trashed (given appropriate naming conventions of my self-created clusters)

lvca · 2016-07-28T15:01:24Z

@rdelangh what do you mean with both your points? Could you please explain them better?

rdelangh · 2016-07-28T16:26:24Z

@lvca
The BACKUP DATABASE mechanism (= full backup, not realistic for huge dbases), or the discussed incremental backup mechanism (not our requirement, we wish to backup specific data-ranges of records) do not allow to keep tight control over which data is actually in need of backup, and which backup-file would contain data that must be restored after e.g. erroneous DELETE.

So, what I hope to figure out in the coming days is a way that resembles very much traditional databases' partitioning mechanisms, where

you upfront create partitions (here: clusters) with a name that reflects some date-range (ie "mycluster_20160728" for records of 28th July 2016), then intelligently insert records in the appropriate cluster, by their date
selecting records for a given date, translates into a SELECT ... FROM cluster:mycluster_20160728 (or similar)
backing up records for a given date, can (hopefully) be done with an EXPORT of the corresponding specific cluster only; similarly to restore the contents of such cluster
cleanup of old, obsoleted data, is a DROP CLUSTER xyz thing, very neat and fast

lvca · 2016-07-28T16:28:55Z

Got it.

lvca · 2016-07-28T16:30:01Z

In order to do this, we need cluster level indexes too.

rdelangh · 2016-07-28T18:09:51Z

Luca,
Andrey commented (orientechnologies/orientdb-labs#7 (comment)) in another subject that

indexes are automatically removed for records which are in a cluster that gets DROPped
the free space in the index files for such removed indexes are re-used

-> do you mean that this mechanism is not sufficient to support cluster-based INSERT, cluster-based EXPORT, and DROP of clusters ?
-> do you suggest that the indexes might become unusable/inconsistent (with a need to rebuild them, a very lengthy process when there is 30 billion records in the class) when working this way?

luigidellaquila · 2018-06-22T15:32:08Z

For cluster IDs in a class you can use

select expand(classes) from metadata:schema

For clusters in the server you can use

select expand(clusters) from metadata:storage

Then you can mix them to get info about clusters in a single class:

select from (
  select expand(clusters) from metadata:storage
) WHERE id in (select classes[name="YourClassName"][0].clusterIds from metadata:schema)

(tested in v 3.0)

Thanks

Luigi

rdelangh · 2018-06-22T21:58:08Z

hi @luigidellaquila
before we can start to use v3, could you please help to fix our issue with Lucene indexes on multiple-fields, in v3 : #8323 ?

rdelangh · 2018-06-22T22:01:27Z

in v2.x, there is no such thing as "metadata:storage", to query the cluster-ids from, or is there another place to look for cluster details?

luigidellaquila · 2018-06-25T07:02:24Z

Hi @rdelangh

In v 2.x there is no way to retrieve the clusters from SQL. You have to do it via API.

eg. in Java:

OClass clazz = db.getMetadata().getSchema().getClass("className");
clazz.getClusterIds();

...

db.getClusterNameById(id);

Thanks

Luigi

SDIPro assigned luigidellaquila May 17, 2016

SDIPro added enhancement sql labels May 17, 2016

lvca added the waiting reply label Jul 28, 2016

luigidellaquila removed the waiting reply label Oct 28, 2016

lvca added this to the 3.0 milestone Aug 4, 2017

lvca modified the milestones: 3.0.0, 3.0.x Apr 9, 2018

luigidellaquila closed this as completed Jun 22, 2018

luigidellaquila removed this from the 3.0.x milestone Jul 10, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SQL Query That Returns The Names of the Clusters for a Class #6152

SQL Query That Returns The Names of the Clusters for a Class #6152

SDIPro commented May 17, 2016

rdelangh commented Jul 27, 2016

lvca commented Jul 28, 2016

rdelangh commented Jul 28, 2016

lvca commented Jul 28, 2016

lvca commented Jul 28, 2016

rdelangh commented Jul 28, 2016

luigidellaquila commented Jun 22, 2018

rdelangh commented Jun 22, 2018

rdelangh commented Jun 22, 2018

luigidellaquila commented Jun 25, 2018

SQL Query That Returns The Names of the Clusters for a Class #6152

SQL Query That Returns The Names of the Clusters for a Class #6152

Comments

SDIPro commented May 17, 2016

rdelangh commented Jul 27, 2016

lvca commented Jul 28, 2016

rdelangh commented Jul 28, 2016

lvca commented Jul 28, 2016

lvca commented Jul 28, 2016

rdelangh commented Jul 28, 2016

luigidellaquila commented Jun 22, 2018

rdelangh commented Jun 22, 2018

rdelangh commented Jun 22, 2018

luigidellaquila commented Jun 25, 2018