Docs blocks (#810) #888

beckjake · 2018-07-31T19:57:39Z

Add support for docs blocks and support for a doc() Jinja macro that references them.

…rsing

…late

…esystem

drewbanin

I really love this. Minimal comments after my first pass through the code -- everything in here looks super good. I'll have some more questions I think after playing around with this in concert with schema yml v2

drewbanin · 2018-08-01T01:02:39Z

dbt/contracts/graph/parsed.py

+    def find_docs_by_name(self, name, package=None):
+        for unique_id, doc in self.docs.items():
+            parts = unique_id.split('.')
+            if len(parts) != 2:


am i crazy or should parts have 3 items here? Something like docs.project_name.docs_name?

drewbanin · 2018-08-01T01:11:30Z

dbt/parser/docs.py

+class DocumentationParser(BaseParser):
+    @classmethod
+    def load_file(cls, package_name, root_dir, relative_dirs):
+        """Load and parse documentation in a lsit of projects. Returns a list


lil typo here

drewbanin · 2018-08-01T01:24:34Z

dbt/parser/docs.py

+
+            # because docs are in their own graph namespace, node type doesn't
+            # need to be part of the unique ID.
+            unique_id = '{}.{}'.format(docfile.package_name, name)


ah! I see. Is there any benefit to keeping the format of these unique ids consistent with the rest of dbt's unique ids?

I don't think so. In the nodes and macros dicts, there are potentially multiple node types so the .-separated namespacing by type makes a lot of sense. In the docs dict (and, I hope, all future manifest members) there's only one type. I would have removed resource_type from the docs entirely if I could have. Docs can only be referenced via doc() and created via {% docs ... %}, and nothing else can be referenced that way, so you never have any ambiguity (like you might with seeds vs models and ref).

That said, if there are strong feelings about consistency, it's a pretty easy change.

ok, I buy that

drewbanin · 2018-08-01T01:26:38Z

dbt/parser/util.py

+            return manifest.find_docs_by_name(target_doc_name,
+                                              target_doc_package)
+
+        candidate_targets = [current_project, node_package, None]


this is phenomenal

drewbanin · 2018-08-02T13:21:23Z

dbt/loader.py

+            root_project=root_project,
+            all_projects=all_projects,
+            root_dir=project.get('project-root'),
+            relative_dirs=project.get('source-paths', []))


@beckjake this code looks for the docs files in the source-paths directory. I think that's a pretty reasonable default. Do you think there's any merit to making an (optional) docs-paths dir as well? I can imagine this would be useful for docs that don't pertain to a specific model, eg. docs for widely-used columns, or the "overview" markdown block, for instance.

cc @cmcarthur

Hmm, I didn't even consider that. I just immediately put it next to the schema.yml. But it does make some sense.

If we did so maybe we should consider something like: relative_dirs=project.get('source-paths', []) + project.get('docs-paths', [])

I actually also wouldn't be against also searching from the root directory, if that's feasible with dbt (maybe relative_dirs + ['.'] would work?).

@beckjake I was thinking the same thing around searching the root directory! I'm a little concerned thought that a random .md file could contain code that would break dbt.

Just as an example, a styleguide.md file (like this might contain {{ ref(...) }} in example sql, and I think the docs parser would probably choke on that, right?

After some testing, we do a recursive search through the dirs, so including . in the list of paths will also snag the target/ directory, which is probably not what we want here.

I think the right way to handle this is to just use docs-paths, where the default value is:

"docs-paths": ["models"]

This will let users override the dirs in their dbt_project.yml file, but will still be sort of a sane default value. You buy that?

I mostly agree, but I think we should default docs-paths to the contents of source-paths, rather than ["models"], if we can. It's a subtle difference, but the idea of "by default we search your source paths" seems really compelling to me.

cool, I buy that. Let's make sure to document that functionality in the docs for the docs :)

drewbanin · 2018-08-02T13:46:19Z

dbt/parser/schemas.py

@@ -403,11 +407,16 @@ def parse_model(cls, model, package_name, root_dir, path, root_project,
                                  all_projects, macros)
            yield 'test', node

+        context = {'doc': dbt.context.parser.docs(model, docrefs)}
+        description = model.get('description')


It looks like if the model doesn't have a description, we pass None into a ParsedNodePatch below, resulting in a validation error. Can we use an empty string default here?

File "/Users/drew/fishtown/dbt/dbt/parser/schemas.py", line 448, in load_and_parse for result_type, node in v2_results: File "/Users/drew/fishtown/dbt/dbt/parser/schemas.py", line 364, in parse_v2_yml for node_type, node in iterator: File "/Users/drew/fishtown/dbt/dbt/parser/schemas.py", line 416, in parse_model patch = ParsedNodePatch( File "/Users/drew/fishtown/dbt/dbt/api/object.py", line 37, in __init__ self.validate() File "/Users/drew/fishtown/dbt/dbt/api/object.py", line 85, in validate raise ValidationException(msg) dbt.exceptions.ValidationException: Runtime Error Invalid arguments passed to "ParsedNodePatch" instance: description.None is not of type 'string'

Oh, yes - I thought I got all of these, I'll see if I left any more.

drewbanin

Ship it!

Jacob Beck added 6 commits July 31, 2018 13:49

initial docs extension work

c67924f

More work on docs blocks, added some rudimentary unit tests around pa…

a4b6048

…rsing

attach the full docs block to ParsedDocumentation instead of the temp…

1123f7e

…late

Add tests, wire up more things

56b7aac

Add a test for late-binding views that has been hanging out on my fil…

f44a512

…esystem

Integration test -> many bug fixes

4e57b17

beckjake requested a review from drewbanin July 31, 2018 19:57

drewbanin reviewed Aug 1, 2018

View reviewed changes

PR fixes, changelog update

777510e

drewbanin reviewed Aug 2, 2018

View reviewed changes

Make a docs-paths, if unset it defaults to source-paths

9767d11

beckjake force-pushed the docs-blocks branch from 9f9baf2 to 9767d11 Compare August 2, 2018 14:26

drewbanin approved these changes Aug 2, 2018

View reviewed changes

beckjake merged commit 4a9e3ee into dev/isaac-asimov Aug 2, 2018

beckjake deleted the docs-blocks branch August 6, 2018 12:45

This was referenced Aug 7, 2018

Docs blocks #810

Closed

integration test for late binding views on redshift #862

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Docs blocks (#810) #888

Docs blocks (#810) #888

beckjake commented Jul 31, 2018

drewbanin left a comment

drewbanin Aug 1, 2018

drewbanin Aug 1, 2018

drewbanin Aug 1, 2018

beckjake Aug 1, 2018

drewbanin Aug 1, 2018

drewbanin Aug 1, 2018

drewbanin Aug 2, 2018

beckjake Aug 2, 2018 •

edited

Loading

drewbanin Aug 2, 2018

beckjake Aug 2, 2018

drewbanin Aug 2, 2018 •

edited

Loading

drewbanin Aug 2, 2018 •

edited

Loading

beckjake Aug 2, 2018

drewbanin left a comment

Docs blocks (#810) #888

Docs blocks (#810) #888

Conversation

beckjake commented Jul 31, 2018

drewbanin left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

beckjake Aug 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drewbanin Aug 2, 2018 • edited Loading

Choose a reason for hiding this comment

drewbanin Aug 2, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

drewbanin left a comment

Choose a reason for hiding this comment

beckjake Aug 2, 2018 •

edited

Loading

drewbanin Aug 2, 2018 •

edited

Loading

drewbanin Aug 2, 2018 •

edited

Loading