-
Notifications
You must be signed in to change notification settings - Fork 128
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
POSIX Transport creates file on Open() without Flush() #1765
Comments
Hi @philip-davis I assume you're referring to the streaming with files case. At the user-level reading should be done after a writer EndStep, but you point out there is a "gray" area (0 bytes) between Open and right before EndStep. Am I understanding the issue correctly? If it is either the reader might have the appropriate logic, or the initial buffer must have a flag to say it exist but nothing was written yet. |
Hi @williamfgc, yes this is for streaming. The gray area is the between Open and the writing of the header of the metadata index file (which contains things like the ADIOS version of the writer, and which is only written once.) This leads to a code path where the reader could misinterpret this header as the index for a new step. |
The write happens only a few lines after the Open in InitTransports(), so it's not a very long time. |
@philip-davis thanks for the clarification. My first guess is that the reader would look for the appropriate index files flags and throw an exception if things are not as expected. As you said, it would be a matter of testing to look for race conditions (even if it's not a very long time). |
Hi @philip-davis, I'm not sure if I understand your question correctly, but I think the flush is called every time the metadata index file is written. Just take a look at the following code from line 530 to 533 in BP4Writer.cpp:
What kind of problem did you run into? It's on the writer side or reader side? |
@philip-davis I see what you are saying. It's actually the code from line 215 to 233:
Basically, you are saying between "OpenFiles" and "FlushFiles", if the reader tries to read the metadata index file, error will occur since it is empty, right? I think Norbert probably added some code on the reader side that handles this situation. But I need to double check with him. |
I'm not sure if I'm actually seeing this cause a problem, I just noticed this when I was trying to figure out what was going on with #1745. The issue is with the open:
As soon as that happens, a zero length md0.idx is visible on the file system (even though there hasn't been a Flush yet.) Once that file exists, OpenFiles() can succeed on the read side, since the idx file and metadata files can be opened without an exception being thrown. Since the idx file is empty, m_MDIndexFileProcessedSize will be set to zero. Later, when the header is written, in BP4Reader::UpdateBuffer the reader will see that the idx file is larger and wrongly process that as a new timestep. |
Yes, that's right, except that it's the write that returns the situation to
safety since the flush does exactly nothing for the PosixTransport:
`void FilePOSIX::Flush() {}`
…On Thu, Sep 19, 2019 at 2:29 PM lwan86 ***@***.***> wrote:
@philip-davis <https://github.com/philip-davis> I see what you are
saying. It's actually the code from line 215 to 233:
m_FileMetadataIndexManager.OpenFiles(
metadataIndexFileNames, m_OpenMode, m_IO.m_TransportsParameters,
m_BP4Serializer.m_Profiler.m_IsActive);
if (m_OpenMode != Mode::Append ||
m_FileMetadataIndexManager.GetFileSize(0) == 0)
{
/* Prepare header and write now to Index Table indicating
* the start of streaming */
m_BP4Serializer.MakeHeader(m_BP4Serializer.m_MetadataIndex,
"Index Table", true);
m_FileMetadataIndexManager.WriteFiles(
m_BP4Serializer.m_MetadataIndex.m_Buffer.data(),
m_BP4Serializer.m_MetadataIndex.m_Position);
m_FileMetadataIndexManager.FlushFiles();
/* clear the metadata index buffer*/
m_BP4Serializer.ResetBuffer(m_BP4Serializer.m_MetadataIndex, true);
}
Basically, you are saying between "OpenFiles" and "FlushFiles", if the
reader tries to read the metadata index file, error will occur since it is
empty, right?
I think Norbert probably added some code on the reader side that handles
this situation. But I need to double check with him.
—
You are receiving this because you were mentioned.
Reply to this email directly, view it on GitHub
<#1765?email_source=notifications&email_token=ABRSFIYALLJLBW35ISI4LDTQKPAG7A5CNFSM4IYNPH62YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD7EM37I#issuecomment-533253629>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/ABRSFI5NWLFTHPM3XZBLOTLQKPAG7ANCNFSM4IYNPH6Q>
.
|
@philip-davis I think this situation has been handled on the reader side. Take a look at the code from line 255 to 267 in BP4Reader.cpp: If the size of the metadata index file is less than 63 bytes, which means the header has not been written, the reader will be just waiting. |
@philip-davis I think what @lwan86 explained addresses the concern. The Flush() function is virtual and it has actual value for buffered I/O backends, |
@lwan86, thanks for pointing that out; I didn't see that code. I think it would help with the situation, except that I can't see that it is updating |
InitBuffer() sets that variable right after OpenFiles(). Openfiles() will
avoid an exception only if the file size is > 63 bytes. So this is not the
reason for the bug.
…On Thu, Sep 19, 2019, 3:11 PM Philip Davis ***@***.***> wrote:
@lwan86 <https://github.com/lwan86>, thanks for pointing that out; I
didn't see that code. I think it would help with the situation, except that
I can't see that it is updating m_MDIndexFileProcessedSize, unless I am
missing it. As long as m_MDIndexFileProcessedSize isn't updated when the
size changes, I think this issue is still present.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#1765?email_source=notifications&email_token=AAYYYLP5XDEFASTW5XH2MULQKPFFXA5CNFSM4IYNPH62YY3PNVWWK3TUL52HS4DFVREXG43VMVBW63LNMVXHJKTDN5WW2ZLOORPWSZGOD7EQR4A#issuecomment-533268720>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAYYYLKFOTGDWR5H77VC4MLQKPFFXANCNFSM4IYNPH6Q>
.
|
@pnorbert Okay, I see that now. So we always have the idx header by the time we leave the IO::Open(). Thanks for the clarification! |
The open() system call can create the file on disk immediately. To demonstrate this, log into two different login nodes on, for example, Cori and run this code on one of them:
abc.txt will be visible on the other login node immediately, both for NFS and Lustre file systems.
The reason I bring this up is that this open() call is the same as the one used by FilePOSIX::Open() when openMode is Mode::Write. If I'm reading the code correctly, FilePOSIX is the default Transport library, including for BP4. This matters for the File Metadata Index. Even though we wait until after writing the header to do a flush, that flush is a no-op for POSIX, and the zero-length file has already been created.
This is, in turn, a problem because the reader could see the zero-length metadata index file, and then interpreting the header as a new timestep. I expect ParseMetadata would fail noisily in this case, but I haven't looked closely enough at the code to know that for sure.
The text was updated successfully, but these errors were encountered: