Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Elimination of all ByteBuffer Usage (for Java 8) #132

Open
sanastas opened this issue Feb 28, 2021 · 8 comments
Open

Elimination of all ByteBuffer Usage (for Java 8) #132

sanastas opened this issue Feb 28, 2021 · 8 comments

Comments

@sanastas
Copy link
Contributor

Base all of the off-heap memory management on unsafe allocation and access, without ByteBuffer intermediate layer (in addition to existing memory management). The reason for the change is the possibility to gain better throughput without spending CPU cycles and memory on ByteBuffer internals.

That means a new memory manager to be based on using addresses instead of ByteBuffers and allocating the address from unsafe:
Unsafe().allocateMemory(size);

That requires a significant code writing: new (alternative) Block, BlockPool, MemoryAllocator, MemoryManager etc.

For more explanations feel free to add questions in this issue.

sanastas pushed a commit that referenced this issue Apr 11, 2021
* removing the use of Bytebuffers to access native memory to improve performance

Co-authored-by: Ramy Fakhoury <[email protected]>
sanastas added a commit that referenced this issue Apr 19, 2021
…Rebase. (#156)

* MM_NEXT_STEP_01 squashed commits: Removing ValueUtils interface, making it an implementation. Creating triangle of Slices.

* Elimination of all ByteBuffer Usage (for Java 8) #132 (#151)

* removing the use of Bytebuffers to access native memory to improve performance

Co-authored-by: Ramy Fakhoury <[email protected]>

* MM_NEXT_STEP_01 squashed commits: Removing ValueUtils interface, making it an implementation. Creating triangle of Slices.

Co-authored-by: Ramy Fakhoury <[email protected]>
Co-authored-by: Ramy Fakhoury <[email protected]>
@dynaxis
Copy link
Contributor

dynaxis commented Apr 3, 2022

It doesn't seem that the doc comments on the relevant code are changed in sync with the changes. For instance, OakUnsafeDirectBuffer's getByteBuffer() returns a new ByteBuffer on every invocation of the method. But the doc comment still says that state of the returned ByteBuffer shouldn't be modified mentioning it might be shared and used elsewhere.

@sanastas
Copy link
Contributor Author

sanastas commented Apr 5, 2022

Hi @dynaxis ,

Thanks for your comment. Here is the comment/documentation about OakUnsafeDirectBuffer's getByteBuffer()

public interface OakUnsafeDirectBuffer {
    /**
     * Return the underlying memory address wrapped as a ByteBuffer.
     * This buffer might contain data that is unrelated to the context in which this object was introduced.
     * Note 1: depending on the context (casting from OakScopedReadBuffer or OakScopedWriteBuffer), the buffer mode
     *         might be ready only.
     * Note 2: the buffer internal state (e.g., byte order, position, limit and so on) should not be modified as this
     *         object might be shared and used elsewhere.
     *
     * @return the underlying memory address wrapped as a ByteBuffer.
     */
    ByteBuffer getByteBuffer();

Indeed, a new ByteBuffer object is returned on each invocation. However, the underlying memory referenced from this ByteBuffer is always the same (for the same OakUnsafeDirectBuffer object). Therefore, indeed, modifying those two ByteBuffers may lead to corrupted results. This is why the interface is named Oak-Unsafe-DirectBuffer and it needs to be used with extra care.

How would you suggest to rephrase the documentation so it is more clear?

@dynaxis
Copy link
Contributor

dynaxis commented Apr 6, 2022

Hi @sanastas,

The doc comment is a little bit misleading in the following points, IMHO:

  1. This buffer might contain data that is unrelated to the context in which this object was introduced.

    I think this is potentially misleading in that currently the returned ByteBuffer is configured to prohibit access to outside of the bounds. It should be safe to touch anywhere within the bounds, isn't it?

  2. Note 2: the buffer internal state (e.g., byte order, position, limit and so on) should not be modified as this object might be shared and used elsewhere.

    In the similar way, it's not possible to touch outside of the configured boundaries. Also the ByteBuffer instance is no longer shared among multiple threads. So it should be ok to change byte order, position, limit and so on.

So my suggestion for a better wording is as follows:

public interface OakUnsafeDirectBuffer {
    /**
     * Returns a ByteBuffer wrapping the underlying memory segment backing this buffer.
     * 
     * Since a new ByteBuffer instance is returned on each invocation of this method, it's absolutely safe
     * to modify the state of the ByteBuffer including its byte order, position, and limit.
     * 
     * Note that the buffer mode of the returned ByteBuffer might be read-only, 
     * depending on whether this buffer has been casted from OakScopedReadBuffer or OakScopedWriteBuffer.
     *
     * @return a ByteBuffer wrapping the underlying memory segment
     */
    ByteBuffer getByteBuffer();

UPDATE: I also found that the ByteBuffer is never read-only different from the note 2 in the current doc comment. So should we remove the last sentence starting with Note that the buffer... or make the ByteBuffers returned from read only buffers actually read-only?

In my opinion, ByteBuffers returned should be set read-only if they are from read-only buffer.

@sanastas
Copy link
Contributor Author

sanastas commented Apr 7, 2022

Hi @dynaxis ,

Thank you for your reply. Your suggestion makes sense. Although I would stress that this buffers brings you no protection from possible concurrent update of the same underlying memory from different ByteBuffer objects. If you are working with single thread then no problem, of course.

public interface OakUnsafeDirectBuffer {
    /**
     * Returns a ByteBuffer wrapping the underlying memory segment backing this buffer. 
     * NOT THREAD SAFE! Two ByteBuffers of the same OakUnsafeDirectBuffer will reference to the same memory, 
     * so concurrent access with at least one write may result in inconsistent state of the data. 
     * 
     * Since a new ByteBuffer instance is returned on each invocation of this method, it's absolutely safe
     * to modify the state of the ByteBuffer including its byte order, position, and limit.
     * 
     * Note that the buffer mode of the returned ByteBuffer might be read-only, 
     * depending on whether this buffer has been casted from OakScopedReadBuffer or OakScopedWriteBuffer.
     *
     * @return a ByteBuffer wrapping the underlying memory segment
     */
    ByteBuffer getByteBuffer();

As for read-only buffer, while creating the buffer inside DirectUtils's wrapAddress we need to check for instance of OakScopedWriteBuffer and only then let the ByteBuffer be writable, otherwise asReadOnlyBuffer().

@dynaxis , as the change is quite easy, would you be able to provide a PR, so I will review and merge to master? So you will join our small but powerful contributors community? :)

@dynaxis
Copy link
Contributor

dynaxis commented Apr 8, 2022

Dear @sanastas,

I agree with your additions to the doc comment, though I might suggest a bit of rewording. Also it's good to issue a PR myself.
BTW, I think the check for read only buffer should be inside getByteBuffer() method rather than DirectUtils.wrapAddress(). Otherwise DirectUtils will have unnecessary tight coupling with buffer types.

Anyway, I'll prepare a PR soon and come back.

@sanastas
Copy link
Contributor Author

sanastas commented Apr 8, 2022

Thank you very much @dynaxis , do as you think is the best way! Very much appreciated!

@dynaxis
Copy link
Contributor

dynaxis commented Apr 8, 2022

@sanastas #209 is issued to resolve the issues I raised above.

@sanastas
Copy link
Contributor Author

sanastas commented Apr 9, 2022

@dynaxis , thank you so much! Review done. Code looks good, just one comment left!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants