-
Notifications
You must be signed in to change notification settings - Fork 493
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Indexing: ability to load balance indexing tasks across multiple app (Glassfish) servers #1757
Comments
Query in sql to get only the modular values: It does seem like EJBQL supports this with MOD( 7, 3 ) = 1, so we probably don't have to write is as a native query. |
As of 317bcbf "index all" can be load balanced across multiple Glassfish servers. I provided a script that explains how to select the number of partitions (i.e. number of Glassfish servers) and for the given server, which partitionId should be run. Here's the script: https://github.com/IQSS/dataverse/blob/master/scripts/search/index Here's some example output: curl 'http://localhost:8080/api/admin/index?numPartitions=2&partitionIdToProcess=0&previewOnly=true'
curl 'http://localhost:8080/api/admin/index?numPartitions=2&partitionIdToProcess=0'
Passing to QA. Heads up to @scolapasta and @landreev and @ekraffmiller |
Since this issue is still open I figure it's fair game to leave a comment here. :) I just wanted to note that in 772813e @scolapasta added a "continue" endpoint which I haven't used personally but is a way to pick up where you left off. @kcondon and I discussed how much of this we should document in the guides and we're not at all sure that these endpoints should be documented and encouraged in their current form. Perhaps we need a new issue about defining what we want, cleaning up those endpoints and documenting them. It would be good to look at the Dataverse 4.0 Search Index Functional Requirements Document if we do. There are a variety of index issues still open and not in QA such as #50 #702 #1408 #1749 #2279. |
@scolapasta @pdurbin The current mod implementation has a continue flag and is intended to continue but it doesn't seem to work. It may have to do with it not updating indextime in the db as it indexes so there is not reference point to continue from. |
@scolapasta and I have been talking about adding the ability to load balance indexing tasks across multiple app (Glassfish) servers in person and in the Dataverse 4.0 Search Index Functional Requirements Document doc.
Imagine if you had three Glassfish servers and each one does part of "index all".
I stubbed out some code and tests at c4fb067 but we need to keep working on this to make it real.
@scolapasta please advise on when this work should continue and who should do it.
The text was updated successfully, but these errors were encountered: