Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Reindex MCP Prod ATM registry index with corrected mappings #339

Open
alexdunnjpl opened this issue Oct 23, 2024 · 11 comments
Open

Reindex MCP Prod ATM registry index with corrected mappings #339

alexdunnjpl opened this issue Oct 23, 2024 · 11 comments

Comments

@alexdunnjpl
Copy link
Contributor

💡 Description

The reindexing sweeper picked up some properties which aren't described by the DD and aren't desirable to fix (or at least, aren't desirable to map as keyword).

These will need to be removed from the mapping, and a new index created with that mapping, migrated from the existing index.

@sjoshi-jpl to discuss with @alexdunnjpl upon his return

⚔️ Parent Epic / Related Tickets

No response

@alexdunnjpl
Copy link
Contributor Author

@jordanpadams per @sjoshi-jpl , the logstash instance was nuked in the past 24hrs, and redeploying/reconfiguring will be a significant (1wk) amount of work.

Given that fact, it seems like the best option available to us may be to manually (i.e. python script) paginate/insert into a new index, run whatever confirmation checks we deem necessary, then delete/rename.

My only concern is ensuring consistency/completeness of the copied data (as was a factor in the pubcloud->mcp migration)

Any thoughts on that?

@tloubrieu-jpl
Copy link
Member

We will use logstash for that work.

@alexdunnjpl
Copy link
Contributor Author

@alexdunnjpl and @sjoshi-jpl to collaborate on this task

@jordanpadams jordanpadams transferred this issue from NASA-PDS/planetary-data-cloud Nov 11, 2024
@tloubrieu-jpl
Copy link
Member

@sjoshi-jpl is preparing the re-indexation with logstash.

@alexdunnjpl
Copy link
Contributor Author

Status: logstash does not support AOSS as an input so logstash migration not possible. @alexdunnjpl to develop reusable tool for this purpose

@tloubrieu-jpl
Copy link
Member

The code goes to registry-operation

@alexdunnjpl
Copy link
Contributor Author

alexdunnjpl commented Dec 4, 2024

Partially implemented, currently blocked pending availability of a fresh temporary index

@sjoshi-jpl see slack for details of request

@alexdunnjpl
Copy link
Contributor Author

Status: implemented/tested consistency check.

Awaiting @sjoshi-jpl for one last new index, then I can do a final migration and we can close this out

@jordanpadams
Copy link
Member

Status: Running running. Keeps running running.

@alexdunnjpl
Copy link
Contributor Author

Status: No longer running - AOSS fails repeatably under load, preventing reindexing sweeper completion.

See #329

@sjoshi-jpl investigating AOSS metrics

@jordanpadams
Copy link
Member

Blocked by #329

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: ToDo
Development

No branches or pull requests

4 participants