-
-
Notifications
You must be signed in to change notification settings - Fork 183
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Triage all proposed metrics (396 of 396 done) #33
Comments
Copy will do this
…On Tue, Jun 4, 2019, 11:05 Rick Viscomi ***@***.***> wrote:
Assigned #33
<#33> to
@tjmonsi <https://github.com/tjmonsi>.
—
You are receiving this because you were assigned.
Reply to this email directly, view it on GitHub
<#33?email_source=notifications&email_token=AAUF5VV6YYNDFWW4VGWNSDDPYXLVVA5CNFSM4HSXT6X2YY3PNVWWK3TUL52HS4DFWZEXG43VMVCXMZLOORHG65DJMZUWGYLUNFXW5KTDN5WW2ZLOORPWSZGORZBXJOI#event-2386785465>,
or mute the thread
<https://github.com/notifications/unsubscribe-auth/AAUF5VUMYMD5VSPLUTFXMNDPYXLVVANCNFSM4HSXT6XQ>
.
|
@HTTPArchive/data-analysts reminder to please go through the Metrics Triage sheet when you have the time. There was a lot of info in the first post so here's a condensed version:
The next step will be to start writing queries and custom metrics using the HTTP Archive forum to discuss solutions. |
I understand we can create Once I understand the custom metrics capabilities, I could fill out the Metrics Triage sheet. |
Good question! Custom metrics are JS snippets you can execute on each page. They are run by our legacy crawl system and the code for existing metrics is here: https://github.com/HTTPArchive/legacy.httparchive.org/tree/master/custom_metrics For example, see the doctype custom metric. To test it, you can run it directly on webpagetest.org under the "Custom" tab: Note that all WPT custom metrics must have You'll see the output in the WPT results: For complex metrics like almanac.js you will need to inspect the JSON results directly to see the output. The test ID for the results is in the URL. Simply append Then you can find the custom metric by going to Let me know if you have any other questions! |
Sorry if I missed this somewhere, but do we need to do something extra to get the right permissions to query the sample datasets created in #34 and/or have our test queries not billed to us individually? :) |
I've updated the permissions of the sample_data dataset so anyone can query it. The goal for that dataset is to allow @HTTPArchive/data-analysts to explore the schema and validate their queries. The table sizes should be small enough so any queries fit comfortably within the free monthly quota. When we run the analysis against the full dataset, I hope to have BQ credits for everyone to cover any expenses. |
@HTTPArchive/data-analysts we're behind on triaging all of the metrics so I think we need to take a different approach. There are 350 metrics and 12 analysts, so that's an average of 30 metrics per analyst. If we divide and conquer that way, we should be able to meet the July 1 deadline. I'll go through the triage sheet and assign each analyst to approximately 30 metrics each grouped by chapter. I'll update this issue with a table of the assignments. I've updated the sheet with Analyst assignments and updated the summary table with each analyst's total metric status. @khempenius and @patrickhulce since you're both authors and expressed interest only in taking on analyst roles for your respective chapters, I didn't add you to any new chapters. @fhoffa I coaxed you into this so I didn't give you too many metrics to work on. Let me know if any of you are willing to take on more metrics, it'd be a big help. @beouss you expressed an interest in joining the team but never accepted your invitation. If you're still interested I'll assign you some metrics. |
Today's the day! I've marked all 5 remaining I'll be syncing the custom metrics with the HTTP Archive server today so they're included in tomorrow's July crawl. |
Assigned: @HTTPArchive/data-analysts team
Due date: No later than July 1
Any metrics that require augmenting the test infrastructure (eg custom metrics) must be ready to go when the July crawl starts. This ensures that when the crawl completes at the end of July, we can query the dataset and pass it off to authors for interpretation in August.
As of now there are 350+ metrics spread over 20 chapters.
I've copied all of the metrics for each chapter to this sheet (named "Metrics Triage"). To edit the sheet please give me your email address to add to the editors list. What we need to do is go through the list of metrics for each chapter and assign a status from one of the following:
The lifecycle is:
Custom metrics should only be added as a last resort and must adhere to strict performance requirements. We test on millions of pages so any complex/slow scripts would impede the crawl. Because we anticipate needing many custom metrics, we'll implement everything as individual functions within a single custom metric whose output is a JSON-encoded object with each result as its own sub-property. More on this when we get there.
Add your name in the
Analyst
column to take responsibility for moving it through the metric lifecycle.Once we're ready to begin writing queries, we will create a thread on https://discuss.httparchive.org for each chapter, listing all queryable metrics. Hopefully we can crowdsource some of the querying by tapping into the power users on the forum.
The text was updated successfully, but these errors were encountered: