-
Notifications
You must be signed in to change notification settings - Fork 870
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
.cpm files grow uncontrollably during millions of adds and deletes #6264
Comments
@zerovian It is known disadvantage of the current cluster. I suggest you to either truncate cluster instead of delete records or partition records between clusters and remove whole clusters. |
Question: So this isn't really limited to "millions of records", right? In other words, the .cpm file will grow forever as any records are added and deleted over time (until a truncate)? If so, that's a fairly gross maintenance requirement. Are any improvements to this "known disadvantage" on the road map? |
Hi, On Thu, Jun 9, 2016 at 5:54 PM Eric Lenington [email protected]
twitter: @Andrey_Lomakin |
Does the syncing done while adding a server node in a cluster remove unused disk space caused by deletions by chance? I realize that isn't a possible solution for everyone, but those using multiple nodes/ an ODB cluster might find it useful to reclaim disk space through a node rotation (should such a growing of file data be a problem). This kind of node rotation to compact the files is possible when working with MongoDB, for example. Scott |
@zerovian @smolinari if you wish to claim space unused by .cpm files please vote for the OEP orientechnologies/orientdb-labs#7 . One of its goals "Reuse space which is used to store rid of delete record." |
Expected behavior and actual behavior
I have a database in which we create some ~million records and then delete the old ones after a few hours a few days. After a couple of days, the .pcl file size has stabilized since we end up creating and deleting the same number of records, and all the records are roughly the same size. But the .cpm file continues to grow.
Can this be fixed or worked around?
I've read the wiki which about plocal not reusing record pointer space which recommends import/export
But...dump and load is NOT an option to recover this lost space as our customers expect to stay running for more than a few days at a time. Months would be more in line with their expectations. Down time for a dump and load is bad. Due to the lost space the database starts around 1.2 GB and after some 3 days ends up at 3.5GB and continues to grow until it eats the entire file system space.
This was previously happening with the .irs files for an index and we managed to come up with a data structure that eliminated the use of the index.
Some quick (real) sample stats from a test we ran.
This is a table of data showing file sizes in kilobytes on disk on certain days.
File name | 2nd june 11:24 AM| 3rd June 11:55 AM | 6th June 2016 11:02 AM
indexstat.cpm 339841 841921 1539393
indexstat.pcl 199553 200001 200001
Steps to reproduce the problem
Create and delete a few million records continuously for a couple of days. Space is lost as .cpm file continues to grow.
Important Questions
Runninng Mode
Misc
OrientDB Version
Operating System
Java Version
The text was updated successfully, but these errors were encountered: