-
Notifications
You must be signed in to change notification settings - Fork 242
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Profiling tool support for collection and analysis (#2590)
* Qualification tool Signed-off-by: Thomas Graves <[email protected]> * remove unused func * Add missing files * Add checks for format option * cast columsn to string to write to text * Revert "Add checks for format option" This reverts commit 6f5271c. * cleanup Signed-off-by: Thomas Graves <[email protected]> * update output dir * formating * Update help messages * update app name * cleanup * put test functions back * fix typo * add printSQLPlanMetrics and printRapidsJar * use opt * Add Analysis * format output * more tests Signed-off-by: Thomas Graves <[email protected]> * tests working * test rearrange utils * move test file * move test file right location * add Analysis Suite * update test analysis * add rapids-4-spark-tools/src/test/resources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetrics_expectation.csv * add more tests * more tests Signed-off-by: Thomas Graves <[email protected]> * remove unneeded expectation file * Add more analysis tests * comment * cleanup * fix logging include
- Loading branch information
Showing
17 changed files
with
1,282 additions
and
69 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
6 changes: 6 additions & 0 deletions
6
.../resources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetrics2_expectation.csv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
appIndex,ID,numTasks,Duration,diskBytesSpilled_sum,duration_sum,duration_max,duration_min,duration_avg,executorCPUTime_sum,executorDeserializeCPUTime_sum,executorDeserializeTime_sum,executorRunTime_sum,gettingResultTime_sum,input_bytesRead_sum,input_recordsRead_sum,jvmGCTime_sum,memoryBytesSpilled_sum,output_bytesWritten_sum,output_recordsWritten_sum,peakExecutionMemory_max,resultSerializationTime_sum,resultSize_max,sr_fetchWaitTime_sum,sr_localBlocksFetched_sum,sr_localBytesRead_sum,sr_remoteBlocksFetched_sum,sr_remoteBytesRead_sum,sr_remoteBytesReadToDisk_sum,sr_totalBytesRead_sum,sw_bytesWritten_sum,sw_recordsWritten_sum,sw_writeTime_sum | ||
1,job_0,213,2515,0,25761,1624,9,120.9,7045,3021,11178,13522,0,0,0,424,0,0,0,0,10,8075,0,2600,80279920,0,0,0,80279920,80279920,2600,849 | ||
1,stage_0,6,1761,0,9455,1624,1540,1575.8,2914,1283,5056,4248,0,0,0,228,0,0,0,0,3,2951,0,0,0,0,0,0,0,40132263,1200,373 | ||
1,stage_1,6,1666,0,9274,1621,1528,1545.7,2568,1004,5016,4099,0,0,0,196,0,0,0,0,4,2951,0,0,0,0,0,0,0,40132257,1200,473 | ||
1,stage_2,200,592,0,6937,221,9,34.7,1518,696,1065,5125,0,0,0,0,0,0,0,0,3,7402,0,2400,80264520,0,0,0,80264520,15400,200,3 | ||
1,stage_3,1,101,0,95,95,95,95.0,45,38,41,50,0,0,0,0,0,0,0,0,0,8075,0,200,15400,0,0,0,15400,0,0,0 |
6 changes: 6 additions & 0 deletions
6
...t/resources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetrics_expectation.csv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,6 @@ | ||
appIndex,ID,numTasks,Duration,diskBytesSpilled_sum,duration_sum,duration_max,duration_min,duration_avg,executorCPUTime_sum,executorDeserializeCPUTime_sum,executorDeserializeTime_sum,executorRunTime_sum,gettingResultTime_sum,input_bytesRead_sum,input_recordsRead_sum,jvmGCTime_sum,memoryBytesSpilled_sum,output_bytesWritten_sum,output_recordsWritten_sum,peakExecutionMemory_max,resultSerializationTime_sum,resultSize_max,sr_fetchWaitTime_sum,sr_localBlocksFetched_sum,sr_localBytesRead_sum,sr_remoteBlocksFetched_sum,sr_remoteBytesRead_sum,sr_remoteBytesReadToDisk_sum,sr_totalBytesRead_sum,sw_bytesWritten_sum,sw_recordsWritten_sum,sw_writeTime_sum | ||
1,job_0,213,2569,0,26735,1598,10,125.5,6500,3433,12095,13414,0,0,0,336,0,0,0,0,8,8075,0,2600,80279908,0,0,0,80279908,80279908,2600,944 | ||
1,stage_0,6,1743,0,9518,1598,1580,1586.3,2509,1391,5309,4043,0,0,0,168,0,0,0,0,3,2951,0,0,0,0,0,0,0,40132250,1200,397 | ||
1,stage_1,6,1631,0,9434,1582,1568,1572.3,2405,1065,5273,3998,0,0,0,168,0,0,0,0,5,2951,0,0,0,0,0,0,0,40132258,1200,505 | ||
1,stage_2,200,688,0,7705,237,10,38.5,1556,940,1474,5337,0,0,0,0,0,0,0,0,0,7359,0,2400,80264508,0,0,0,80264508,15400,200,42 | ||
1,stage_3,1,83,0,78,78,78,78.0,30,37,39,36,0,0,0,0,0,0,0,0,0,8075,0,200,15400,0,0,0,15400,0,0,0 |
11 changes: 11 additions & 0 deletions
11
...ources/ProfilingExpectations/rapids_join_eventlog_jobandstagemetricsmulti_expectation.csv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,11 @@ | ||
appIndex,ID,numTasks,Duration,diskBytesSpilled_sum,duration_sum,duration_max,duration_min,duration_avg,executorCPUTime_sum,executorDeserializeCPUTime_sum,executorDeserializeTime_sum,executorRunTime_sum,gettingResultTime_sum,input_bytesRead_sum,input_recordsRead_sum,jvmGCTime_sum,memoryBytesSpilled_sum,output_bytesWritten_sum,output_recordsWritten_sum,peakExecutionMemory_max,resultSerializationTime_sum,resultSize_max,sr_fetchWaitTime_sum,sr_localBlocksFetched_sum,sr_localBytesRead_sum,sr_remoteBlocksFetched_sum,sr_remoteBytesRead_sum,sr_remoteBytesReadToDisk_sum,sr_totalBytesRead_sum,sw_bytesWritten_sum,sw_recordsWritten_sum,sw_writeTime_sum | ||
1,job_0,213,2569,0,26735,1598,10,125.5,6500,3433,12095,13414,0,0,0,336,0,0,0,0,8,8075,0,2600,80279908,0,0,0,80279908,80279908,2600,944 | ||
1,stage_0,6,1743,0,9518,1598,1580,1586.3,2509,1391,5309,4043,0,0,0,168,0,0,0,0,3,2951,0,0,0,0,0,0,0,40132250,1200,397 | ||
1,stage_1,6,1631,0,9434,1582,1568,1572.3,2405,1065,5273,3998,0,0,0,168,0,0,0,0,5,2951,0,0,0,0,0,0,0,40132258,1200,505 | ||
1,stage_2,200,688,0,7705,237,10,38.5,1556,940,1474,5337,0,0,0,0,0,0,0,0,0,7359,0,2400,80264508,0,0,0,80264508,15400,200,42 | ||
1,stage_3,1,83,0,78,78,78,78.0,30,37,39,36,0,0,0,0,0,0,0,0,0,8075,0,200,15400,0,0,0,15400,0,0,0 | ||
2,job_0,213,2515,0,25761,1624,9,120.9,7045,3021,11178,13522,0,0,0,424,0,0,0,0,10,8075,0,2600,80279920,0,0,0,80279920,80279920,2600,849 | ||
2,stage_0,6,1761,0,9455,1624,1540,1575.8,2914,1283,5056,4248,0,0,0,228,0,0,0,0,3,2951,0,0,0,0,0,0,0,40132263,1200,373 | ||
2,stage_1,6,1666,0,9274,1621,1528,1545.7,2568,1004,5016,4099,0,0,0,196,0,0,0,0,4,2951,0,0,0,0,0,0,0,40132257,1200,473 | ||
2,stage_2,200,592,0,6937,221,9,34.7,1518,696,1065,5125,0,0,0,0,0,0,0,0,3,7402,0,2400,80264520,0,0,0,80264520,15400,200,3 | ||
2,stage_3,1,101,0,95,95,95,95.0,45,38,41,50,0,0,0,0,0,0,0,0,0,8075,0,200,15400,0,0,0,15400,0,0,0 |
84 changes: 84 additions & 0 deletions
84
.../src/test/resources/ProfilingExpectations/rapids_join_eventlog_sqlmetrics_expectation.csv
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,84 @@ | ||
sqlID,nodeID,nodeName,accumulatorId,name,max_value,metricType | ||
0,0,GpuColumnarToRow,33,total time,857404,nsTiming | ||
0,1,GpuHashAggregate,34,output rows,1,sum | ||
0,1,GpuHashAggregate,35,output columnar batches,1,sum | ||
0,1,GpuHashAggregate,36,total time,4212819,nsTiming | ||
0,1,GpuHashAggregate,37,aggregation time,3846803,nsTiming | ||
0,2,GpuShuffleCoalesce,39,output rows,200,sum | ||
0,2,GpuShuffleCoalesce,40,output columnar batches,1,sum | ||
0,2,GpuShuffleCoalesce,41,total time,3803240,nsTiming | ||
0,2,GpuShuffleCoalesce,42,collect batch time,3277904,nsTiming | ||
0,2,GpuShuffleCoalesce,43,concat batch time,392509,nsTiming | ||
0,3,GpuColumnarExchange,44,partition data size,16000,sum | ||
0,3,GpuColumnarExchange,45,partitions,1,sum | ||
0,3,GpuColumnarExchange,46,output rows,200,sum | ||
0,3,GpuColumnarExchange,47,output columnar batches,200,sum | ||
0,3,GpuColumnarExchange,48,data size,19600,size | ||
0,3,GpuColumnarExchange,50,local blocks read,200,sum | ||
0,3,GpuColumnarExchange,53,local bytes read,15400,size | ||
0,3,GpuColumnarExchange,54,fetch wait time,0,timing | ||
0,3,GpuColumnarExchange,55,records read,200,sum | ||
0,3,GpuColumnarExchange,56,shuffle bytes written,15400,size | ||
0,3,GpuColumnarExchange,57,shuffle records written,200,sum | ||
0,3,GpuColumnarExchange,58,shuffle write time,93193331,nsTiming | ||
0,4,GpuHashAggregate,59,output rows,200,sum | ||
0,4,GpuHashAggregate,60,output columnar batches,200,sum | ||
0,4,GpuHashAggregate,61,total time,80781515,nsTiming | ||
0,4,GpuHashAggregate,62,aggregation time,31923387,nsTiming | ||
0,5,GpuProject,64,total time,5377158,nsTiming | ||
0,6,GpuShuffledHashJoin,65,output rows,10000000,sum | ||
0,6,GpuShuffledHashJoin,66,output columnar batches,200,sum | ||
0,6,GpuShuffledHashJoin,67,total time,3904332009,nsTiming | ||
0,6,GpuShuffledHashJoin,68,build side size,80000000,size | ||
0,6,GpuShuffledHashJoin,69,build time,3448606506,nsTiming | ||
0,6,GpuShuffledHashJoin,70,stream time,260796041,nsTiming | ||
0,6,GpuShuffledHashJoin,71,join time,178084313,nsTiming | ||
0,6,GpuShuffledHashJoin,72,join output rows,10000000,sum | ||
0,7,GpuShuffleCoalesce,74,output rows,10000000,sum | ||
0,7,GpuShuffleCoalesce,75,output columnar batches,200,sum | ||
0,7,GpuShuffleCoalesce,76,total time,261389422,nsTiming | ||
0,7,GpuShuffleCoalesce,77,collect batch time,167775821,nsTiming | ||
0,7,GpuShuffleCoalesce,78,concat batch time,83550919,nsTiming | ||
0,8,GpuColumnarExchange,79,partition data size,42872100,sum | ||
0,8,GpuColumnarExchange,80,partitions,200,sum | ||
0,8,GpuColumnarExchange,81,output rows,10000000,sum | ||
0,8,GpuColumnarExchange,82,output columnar batches,1200,sum | ||
0,8,GpuColumnarExchange,83,data size,40076192,size | ||
0,8,GpuColumnarExchange,85,local blocks read,1200,sum | ||
0,8,GpuColumnarExchange,88,local bytes read,40132258,size | ||
0,8,GpuColumnarExchange,89,fetch wait time,0,timing | ||
0,8,GpuColumnarExchange,90,records read,1200,sum | ||
0,8,GpuColumnarExchange,91,shuffle bytes written,40132258,size | ||
0,8,GpuColumnarExchange,92,shuffle records written,1200,sum | ||
0,8,GpuColumnarExchange,93,shuffle write time,508750471,nsTiming | ||
0,9,GpuProject,94,total time,6667140,nsTiming | ||
0,10,GpuRowToColumnar,95,total time,61112304,nsTiming | ||
0,11,WholeStageCodegen (1),96,duration,5463,timing | ||
0,13,Scan,97,number of output rows,10000000,sum | ||
0,14,GpuCoalesceBatches,98,output rows,10000000,sum | ||
0,14,GpuCoalesceBatches,99,output columnar batches,200,sum | ||
0,14,GpuCoalesceBatches,100,total time,3383354389,nsTiming | ||
0,14,GpuCoalesceBatches,101,collect batch time,3275108263,nsTiming | ||
0,14,GpuCoalesceBatches,102,concat batch time,20312708,nsTiming | ||
0,14,GpuCoalesceBatches,103,peak device memory,80000000,size | ||
0,15,GpuShuffleCoalesce,107,output rows,10000000,sum | ||
0,15,GpuShuffleCoalesce,108,output columnar batches,200,sum | ||
0,15,GpuShuffleCoalesce,109,total time,3266208420,nsTiming | ||
0,15,GpuShuffleCoalesce,110,collect batch time,359397047,nsTiming | ||
0,15,GpuShuffleCoalesce,111,concat batch time,104974316,nsTiming | ||
0,16,GpuColumnarExchange,112,partition data size,42872100,sum | ||
0,16,GpuColumnarExchange,113,partitions,200,sum | ||
0,16,GpuColumnarExchange,114,output rows,10000000,sum | ||
0,16,GpuColumnarExchange,115,output columnar batches,1200,sum | ||
0,16,GpuColumnarExchange,116,data size,40076192,size | ||
0,16,GpuColumnarExchange,118,local blocks read,1200,sum | ||
0,16,GpuColumnarExchange,121,local bytes read,40132250,size | ||
0,16,GpuColumnarExchange,122,fetch wait time,0,timing | ||
0,16,GpuColumnarExchange,123,records read,1200,sum | ||
0,16,GpuColumnarExchange,124,shuffle bytes written,40132250,size | ||
0,16,GpuColumnarExchange,125,shuffle records written,1200,sum | ||
0,16,GpuColumnarExchange,126,shuffle write time,400284505,nsTiming | ||
0,17,GpuProject,127,total time,207820,nsTiming | ||
0,18,GpuRowToColumnar,128,total time,58640462,nsTiming | ||
0,19,WholeStageCodegen (2),129,duration,5920,timing | ||
0,21,Scan,130,number of output rows,10000000,sum |
Oops, something went wrong.