HADOOP-13327 Output Stream Specification. #2587

steveloughran · 2021-01-04T14:23:52Z

Specification of OutputStream and Syncable

with

RawLocalFileSystem to implement Syncable
Consistent use of StreamCapabilities everywhere

This is a rebase of #2102. Because the RawLocalOutputStream is now wrapped by BufferedIOStatisticsOutputStream, which does passthrough of stream capabilities and the Syncable API, the tests which were failing there should now work

steveloughran · 2021-01-04T14:46:38Z

does not address final comments in that review
no ITest runs (I don't think it needs it; will look at again)

joshelser · 2021-01-25T17:53:00Z

Tagging a few HBase folks who may be interested in this as well, @ndimiduk @saintstack @Apache9 @busbey given the previous work on apache/hbase#1408 and apache/hbase#1597.

Steve had mentioned to me that he was thinking of HBase and us being able to more safely put WALs onto file:// with these changes (not having to just disable the checks with LocalFileSystem).

joshelser

All makes sense to me. Tried my best to read (and not just skim) outputstream.md. Excellent work. Thanks for bringing to my attention, Steve!

steveloughran · 2021-01-26T19:46:41Z

Steve had mentioned to me that he was thinking of HBase and us being able to more safely put WALs onto file:// with these changes (not having to just disable the checks with LocalFileSystem).

you are still going to need some robust storage, RAID-1+ or similar, but we are getting sync all the way through, and you can query the streams to make sure they say they support it -including local fs.

steveloughran · 2021-01-26T19:47:18Z

checkstyle nits

./hadoop-common-project/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractCreateTest.java:512:          if (c == -1) {:24: Must have at least one statement. [EmptyBlock]
./hadoop-hdfs-project/hadoop-hdfs-client/src/main/java/org/apache/hadoop/hdfs/DFSOutputStream.java:70:import org.apache.hadoop.util.StringUtils;:8: Unused import - org.apache.hadoop.util.StringUtils. [UnusedImports]
./hadoop-tools/hadoop-azure/src/main/java/org/apache/hadoop/fs/azure/BlockBlobAppendStream.java:30:import java.util.Locale;:8: Unused import - java.util.Locale. [UnusedImports]
./hadoop-tools/hadoop-azure/src/main/java/org/apache/hadoop/fs/azure/SyncableDataOutputStream.java:31:import org.apache.hadoop.classification.InterfaceAudience;:1: Duplicate import to line 28 - org.apache.hadoop.classification.InterfaceAudience. [RedundantImport]

joshelser · 2021-01-28T01:38:33Z

you are still going to need some robust storage, RAID-1+ or similar, but we are getting sync all the way through, and you can query the streams to make sure they say they support it -including local fs.

Yep, for sure. This is a nice improvement.

steveloughran · 2021-01-29T14:10:20Z

checkstyles and whitespace. The javac changes are all from deprecating hflush capability

mehakmeet

+1, pending by some nits. I have already reviewed the .md in the previous pr(now closed) and was satisfied.

mehakmeet · 2021-02-02T10:14:55Z

...ct/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractCreateTest.java

+      try {
+        out.hflush();
+        if (!supportsFlush) {
+          // hsync not ignored


little confusing comment, maybe we can change this to "hysc tests not ignored"?

mehakmeet · 2021-02-02T10:15:01Z

...ct/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractCreateTest.java

+    LOG.info("Expecting files under {} to have supportsSync={}"
+            + " and supportsFlush={}",
+        path, supportsSync, supportsFlush);
+


nit: blank line.

mehakmeet · 2021-02-02T10:15:04Z

...ct/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractCreateTest.java

+          LOG.info("Successfully read synced data on a new reader {}", in);
+        }
+      } else {
+        // np sync. Let's do a flush and see what happens.


nit: typo in "np" -> "no"

mehakmeet · 2021-02-02T10:19:18Z

...ct/hadoop-common/src/test/java/org/apache/hadoop/fs/contract/AbstractContractCreateTest.java

+          if (!isSupported(IS_BLOBSTORE)) {
+            throw e;
+          }
+        }


Maybe add a LOG.warn() about the FileNotFoundException if it is an object store?

mehakmeet · 2021-02-02T10:34:43Z

hadoop-common-project/hadoop-common/src/main/java/org/apache/hadoop/fs/RawLocalFileSystem.java

    public LocalFSFileInputStream(Path f) throws IOException {
      fis = new FileInputStream(pathToFile(f));
+      bytesRead = ioStatistics.getCounterReference(


No issues here, just confirming if these changes aren't part of a different PR by mistake(IOStats PR).

this is a followon to IOStats...it was actually blocked on the buffer between row local fs output and FSDataOutput:
org.apache.hadoop.fs.statistics.BufferedIOStatisticsOutputStream - needed for hsync passthrough to the File output stream. We're now generating IOStats for the local FS too.

mehakmeet · 2021-02-02T11:40:24Z

...op-common/src/test/java/org/apache/hadoop/fs/contract/localfs/TestLocalFSContractCreate.java

 import org.apache.hadoop.conf.Configuration;
+import org.apache.hadoop.fs.FileSystem;


unused import.

steveloughran · 2021-02-08T12:30:41Z

FWIW, here are the IOStats logged on the runs

ABFS (which supports hsync)

.dfs.core.windows.net/stevel-testing/test/testSyncable?action=flush&retainUncommittedData=false&position=2&close=true&timeout=90
2021-02-08 12:27:48,642 INFO  [JUnit-testSyncable]: contract.AbstractContractCreateTest (AbstractContractCreateTest.java:validateSyncableSemantics(586)) - IOStatistics counters=((queue_shrunk_ops=1) (time_spent_task_wait.failures=0) (write_current_buffer_ops=2) (bytes_upload_failed=0) (bytes_upload=2) (time_spent_on_put_request.failures=0) (time_spent_on_put_request=2) (time_spent_task_wait=0) (bytes_upload_successfully=2));
gauges=();
minimums=((time_spent_task_wait.failures.min=-1) (time_spent_on_put_request.min=28) (time_spent_task_wait.min=-1) (time_spent_on_put_request.failures.min=-1));
maximums=((time_spent_on_put_request.failures.max=-1) (time_spent_task_wait.max=-1) (time_spent_on_put_request.max=36) (time_spent_task_wait.failures.max=-1));
means=((time_spent_task_wait.mean=(samples=0, sum=0, mean=0.0000)) (time_spent_on_put_request.failures.mean=(samples=0, sum=0, mean=0.0000)) (time_spent_task_wait.failures.mean=(samples=0, sum=0, mean=0.0000)) (time_spent_on_put_request.mean=(samples=2, sum=64, mean=32.0000)));

s3a

2021-02-08 12:29:05,294 [JUnit-testSyncable] WARN  contract.AbstractContractCreateTest (AbstractContractCreateTest.java:validateSyncableSemantics(575)) - Output file was not created; this is an object store with different visibility semantics
2021-02-08 12:29:05,500 [JUnit-testSyncable] INFO  contract.AbstractContractCreateTest (AbstractContractCreateTest.java:validateSyncableSemantics(586)) - IOStatistics counters=((stream_write_bytes=2) (stream_write_queue_duration=0) (action_executor_acquired=1) (stream_write_block_uploads=1) (stream_write_exceptions=0) (stream_write_exceptions_completing_upload=0) (stream_write_total_time=0) (stream_write_total_data=2) (action_executor_acquired.failures=0));
gauges=((stream_write_block_uploads_data_pending=0) (stream_write_block_uploads_pending=1));
minimums=((action_executor_acquired.failures.min=-1) (action_executor_acquired.min=0));
maximums=((action_executor_acquired.max=0) (action_executor_acquired.failures.max=-1));
means=((action_executor_acquired.failures.mean=(samples=0, sum=0, mean=0.0000)) (action_executor_acquired.mean=(samples=1, sum=0, mean=0.0000)));

mukund-thakur

LGTM +1, pending some nits.
Awesome work on the docs. @Steve

mukund-thakur · 2021-02-08T16:25:35Z

...-project/hadoop-common/src/main/java/org/apache/hadoop/fs/impl/StoreImplementationUtils.java

+   * Probe for an input stream having a capability; returns true
+   * if the stream implements {@link StreamCapabilities} and its
+   * {@code hasCapabilities()} method returns true for the capability.
+   * @param out output stream


input stream

mukund-thakur · 2021-02-09T05:17:24Z